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provides nucleotide sequences encoding the invention chimeric proteins, cells containing such nucleotide sequences, and methods 
Q for using the invention chimeric proteins to modulate expression of one or more exogenous genes in a subject organism. In addition, 
isolated protein crystals suitable for x-ray diffraction analysis and methods for obtaining putative ligands for the invention chimeric 
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HORMOHE RECEPTOR FUniC TrOHAL EMTtTIES 

AWPMn-HOPSOFTHElBUSE 

The present invention relates to methods in the field of recombinant DN A technology, and products related thereto, in a 
particular aspect the invention relates to methods for modulating the expression of exogenous genes in mammalian or non- 
mammaiian systems, and products useful therefor. 

. -5 BACKg B PUWP Qf TH EI HV g HI TtP H 

It is known in the art to produce fusion proteins for a number of purposes. In some cases, the two protein units in 
the single polypeptide have two essentially independent activities. The most common example of this application is the 
fusion of marking proteins, such as 6FP, to intracellular factors as a means of observing their localization and expression 
Isee. for example, A.W. Kerrebrock et aL, Cell, fl3:247-5B, 1995; H.G. Wang et aL Cell, 32:629 638, 1996). Creation of 
10 fusion proteins has also been used to prolong the half -life of a protein {see, for example, R.A. Haiteweli, et aL i Biol Chem., 
2fi4;5260.5268, 1989; T.P. Yao et aL, Cell, 22:6372. 19921 as well as other uses (see, for example. T. Sano etaL Proc 
Natl Acad SciU,SA, 23:1534*1538. 1992). 

A more complicated application of protein fusion is the production of fusion proteins wherein the two protein units 
cooperate to achieve a biological function. In functional dimers. both proteins must fold and interact with each other 

1 5 appropriately. V.A. Garcta Campayo et aL {Nature Biotech, i£:663-667, (1997)) have utilized a peptide linker to fuse gene 
subunits together into a single biologically active peptide. Neuhold and Wotd, {CelL 24:1033-1042, (1 993)) have reported 
the fusion of two proteins into a single biologically active protein that binds DNA targets, wherein the protein units interact 
with each other to the exclusion of competing heterodimer partners. However, fusion of proteins with multiple functions has 
been more difficult to produce, for example, steroidlthyroid hormone nuclear receptors are complex, multifunctional proteins 

20 with, minimally, four interconnected yet separable functions: ligand binding, dimerization, DMA binding, and transactivation. 

Steroid/thyroid homione nuclear receptors are used in the field of genetic engineering as a tool for studying control of 
gene expression and to manipulate and control development and other physiological processes. For example, applications for 
regulated gene expression in mammalian systems include inducible gene targeting, overexpression of toxic and teratogenic genes, 
anti-sense RNA expression, and gene therapy (see, for example, R. Jaenisch, 5c/e/7fe 240:1468-1474, 1988). For cuhured cells. 
25 glucocorticoids and other steroids have been used to induce the expression of a desired gene. 

As another means for controlling gene expression in mammalian systems, an inducible tetracycline regulated system has 
been devised and utilized in transgenic mice, whereby gene activity is induced in the absence of tetracycline and repressed in its 
presence (see, e.g. Gossen ^/ 5/. ^.45 23:5547-5551, 1992; Gossene/5/., 77W 15:471475, 1993; ?unhetaL,Pm 
21:9302-9306. 1994; and Shockett etaL /'/MS 32:6522-6526. 1995). However, disadvantages of the inducible tetracycline 

1 
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system include the requirement for continuous administration of tetracycline to repress expression and the slow clearance of 
antibiotic from bone, a side-effect that interferes with regulation of gene expressioa White this system has been improved by the 
recent identification of a mutant tetracycline repressor that acts conversely as an inducible activator, the pharmacokinetics of 
tetracycline may hinder its use during development when a precise and efficient "on off " switch is essential (see, e.g., Gossen at 
5 aL 5c/;p/7r52fiS:1766'1769. 1995K 

Certain insect steroid/thyroid hormone nuclear receptors have also been studied. The Orasoph//a melanogaster 
ecdysone receptor lEcR) IM. R. Koetle et aL C^//fi2:59*77, 1995) is unlike the estrogen, androgen, and other homodimeric 
vertebrate steroid hormone nuclear receptors because it requires a heterologous dimer partner for functional transactivation. 
The obligate dimer partner, the product of the u/trasp/rach (Usp) gene (V. C. MenrkhataL, Nuc. Acids Res. Ifi: 41434148, 
10 1990; T. P. Yao etaL, supra. 1992; T. P. Yao etaL Nature ^AUAn. 1993K is an insect homolog of the mammalian 
retinoid X receptor (RXR) proteins found in vertebrates and other mammalian species. RXRs have been characterized as 
regulatory dimer partners of many mammalian class 11 steroid/thyroid hormone nuclear receptors, such as the thyroid 
hormone receptors, the retinoic acid receptors, and the vitamin 0 receptor (reviewed in Mangelsdorf and Evans, ZT^// 83:84 1 - 
850, 1995; D. J. Mangelsdorf et at., Cefl^: 835-839, 1995). RXR is also a dimer partner of EcR. 

Usp and RXR share a significant degree of sequence homology and some functional similarities; however, in 
formation of heterodimers with EcR, RXR interacts differently than Usp. One primary difference is that formation of 
EcR I- RXR heterodimers is more highly stimulated by the steroid ligand ecdysteroid muristerone A (murA) than by 20- 
hydroxyecdysone (20'Ec), while formation of EcR-^Usp heterodimers is potently stimulated by 20-hydroxyecdysone (K. 
SXhristopherson etaL Proc Natl Acad Sci U S A S^BZHSm. 1982; H. E.Thomas era/., ter^/e 352:471.475, 1993). 
A second difference is in the way that ligand promotes efficient formation of EcR + Usp and EcR + RXR heterodimer 
complexes and concomitant binding to ecdysone response elements lEcREsl. MurA stimulates EcR +Usp binding of EcREs 
approximately 3 to 7-fold over levels without ligand, but EcR ^ RXR complexes are completely dependent on itgand for 
heterodimerization. Further EcR + RXR complexes bind to EcREs at only 10 40% the level of EcR + Usp complexes 
(Christopherson et aL, supra 1 982; Thomas et aL supra 1 993; Yao et aL supra, 1 992 & 1 993). This suggests that the 
affinity of EcR for its natural dimer partner, Usp, is significantly greater than its affinity for RXR. 

EcR has been studied for use in transgene regulation; however, its use for this purpose is complicated by the 
requirement for superphysiological levels of RXR protein to be coexpressed (No et aL supra 1997), presumably because of 
the comparatively low affinity of EcR for RXR as a dimer partner. Of the mammalian cell types heretofore examined, only 
the 293 cell line appears capable of supporting high level transactivation of EcR without added RXR (Christopherson er^A, 
30 supra, 1 982). The requirement for co-expression of RXR in most mammalian systems raises concerns that RXR will 
heterodimerize with endogenous mammalian class II steroid/thyroid hormone nuclear receptors, causing altered 
differentiation, growth, or fitness of transduced cells. 

2 
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A number of ecdysone receptors are known in the art as being a gene sequence responsive to an applied exogenous 
chemical inducer enabling external control of expression of the gene controlled by the receptor (See, for example, 
PCT/GB96;01195). 

Accordingly, there is a need in the art for improved systems to precisely modulate the expression of exogenous genes in 
5 mammalian subjects. For example, a non-mammalian-hased transCTiption regulating system would be extremely desirable for 
general application to transgene regulation in in vitro, ex vivo, and in vivo applications. In addition, there is a need in the art f or 
new and better methods of using steroid/thyroid hormone nuclear receptors that require a dimer partner for functional 
transactivation of transgene expression for use in somatic gene therapy and for laboratory models thereof. 

BRIEF DESCRIPTIOM OF THE IMVEWimM 

' ^ accordance with the present invention, there are provided chimeric proteins comprising at least two functional 

protein units, wherein each functional protein unit comprises the dimerization domain of a member of the steroid/thyroid 
hormone nuclear receptor superfamily, and an optional linker interposed therebetween, wherein the at least two protein 
units form a functional entity. When the chimeric protein contains two functional protein units, the chimeric protein forms a 
functional dimer (FD), for example a heterodimer or a homodimer. In one embodiment according to the present invention, 

1 5 each protein unit comprises a ligand binding domain and an optional hinge domain of a steroid/thyroid hormone nuclear 
receptor member, and an optional DNA binding domain. The functionality of the entity is independent of the order of the 
protein units in the chimeric protein. Polynucleotides encoding the invention chimeric protein and cells containing such 
polynucieotidefs) are also provided according to the present invention, in one embodiment according to the present 
invention, the invention polynucleotide encodes the invention chimeric protein as a fusion protein, with one or more linker (s) 

20 encoded as a polypeptide linker. 

In accordance with another embodiment of the present invention, there are provided methods for modulating the 
expression of exogenous gene(s) in a subject organism containing DNA constructs) encoding and expressing invention 
chimeric protein(s) and DNA constructis) encoding and expressing exogenous genels) under the control of a response 
element. The invention method for modulating the expression of exogenous genels} in a subject organism comprises 
25 administering to the subject an effective amount of an exogenous ligand for at least one functional unit of the chimeric 
protein. 

The present DNA binding studies indicate that many of the invention functional dimers (FDs) display DNA binding 
equivalent or superior to that of receptor complexes formed from and/or containing identical wild type members of the 
steroid/thyroid hormone nuclear receptor superfamily (i.e„ the same two members from which the invention chimeric protein 
30 is derived). Transient transfection analysis reveals that distinct groups of FD constructs transactivate responsive promoters 
in a manner similar to wild-type complexes, while others lose the capacity to transactivate and function like constitutive 
repressors. 



3NSDOCID; <WO__ 0136447A2 J. 




wo 01/36447 PCT/USOO/41224 

Competition experiments and supporting data reveal that FDs favor dimerization with dimer partners contained 
within a chimeric protein over interaction with other wild type dimer partners. These results demonstrate that certain of the 
invention chimeric protein FDs share properties of monomeric receptor complexes while others have novel characteristics 
unique to individual constructs. 

5 To enhance the possibility of producing a functional entity upon expression, the invention chimeric proteins allow 

for any of the protein units to be positioned at the amino terminus of the chimeric protein, in addition, to enhance flexibility 
for proper folding and three-dimensional orientation of the protein units into a functional entity, an optional linker can be 
interposed between the protein units in the chimeric protein. A variety of different linkers can be used in the invention 
chimeric proteins, including chemical and polypeptide linkers, with the latter being preferred if the entity is expressed as a 

10 fusion protein. In a presently preferred embodiment, the linker is designed to allow for incremental elongation of the linker 
distance interposed between the two protein units. 

BRIEF DESCBIPTIPW OF THE FIGUBES 

In the interests of brevity and consistency, the names of receptors and dimer partners have been abbreviated f or 
use herein as follows: ''E^ U", or 'R' alone indicates a monomeric receptor protein or dimer partner (i.e., not contained in 

1 5 an invention chimeric protein) containing, respectively, at least the iigand binding domain of the Orosophila ecdysone 

receptor, the ultraspiracle protein, or the retinoid X receptor. When contained within an invention "fusion protein", which is 
alternatively referred to herein as a "functional dimer", these receptor proteins are represented by "E*, 'U^ or "R" separated 
by either an "N", representing a linker of any length, or a numeral from 0 to 20, indicative of a linker containing a specific 
number of linker segments wherein each linker segment contains 12 amino acids. In the description of invention fusion 

20 proteins, which are functional dimers, the leading letter in the abbreviation indicates the receptor protein at the amino 
terminus of the fusion protein. For example, E5U means a fusion protein having at least the Iigand binding domain, hinge 
domain, and optionally functional ONA binding domain of DrosopMa ecdysone receptor at the amino terminus, a linker 
containing 5 linker segments (of 12 amino acids each, plus the 5 amino acid linker bridge (i.e., a linker containing a total of 
65 amino acids) and the comparable domains of the ultraspiracle protein. An initial 'V" in the construct abbreviation, e.g., 

25 VE5U or VE, indicates fusion of the VP16 1 activation domain to the N*terminus of the fusion protein, as described more 
completely hereafter. 

Figure 1 is a schematic diagram of an nucleic acid construct encoding invention fusion proteins that contain EcR 
(darkly shaded! with a dimer partner, U (Usp) or R (RXR) (darkly shaded). "D" - DMA binding domain; "L" - Iigand bindi ng 
domain; curvilinear line fusion bridge. "Individual" represents a nucleotide sequence that encodes the wild type C terminus 
30 of EcR (receptor) and the monomeric N terminus of RXR (binding partner) before introduction of a nucleotide sequence 
encoding a fusion bridge. "Fused'' represents the same segments with nucleotides inserted that encode a 5 amino acid 
fusion bridge containing the Sfil insertion site. ''Tether" indicates a nucleotide sequence that encodes a 12 amino acid linker 



\ 
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to be inserted into the Sfil site of the fusion bridge to produce fusion proteins with greater spacing between the two protein 
units (i.e., dimer partners) in the invention fusion protein. 

Figures 2A-B illustrate the results of gel mobility shift assays of response element binding of the invention FD 
constructs having linkers containing either 0 or 5 linker segments as compared with that of monomeric receptor complexes 
(translated in vhroK 

Figure 2A is a graph quantifying gel mobility shift as a result of response element binding to invention endodimer 
FDs in the presence of murA. Controls were treated either with vehicle (open bars) or with 1 \M murA as ligand (black 
bars). Bars are labeled along the bottom with FDs named as described in the text. E represents an EcR only control; NON 
represents a non transfected control. E+U and E+R are control lanes of monomeric w vino translated proteins used for 
sizing of endodimer band shifts. Numbers at the top of each bar represent relative*fold increase in response element binding 
resulting from ligand treatment. 



Figure 2B is a schematic representation of five F-domain deletion constructs containing EcR (darkly shaded) and 
RXR (lightly shaded) with no linker polypeptide (EOR). Incremental deletions are shown to Nhel, Pvull, Marl, and Bglll sites 
within the ecdysone receptor F domain. The top schematic represents EOR (1340 amino acids) containing the complete F- 
1 5 domain (bracketed); the second schematic represents a deletion (A60 amino acids) to the Nhel site; the third schematic 

represents a deletion (A138 amino acids) to the Pvull site; the fourth schematic represents a deletion (A198 amino acids) to 
the Narl site; and the fifth schematic represents a deletion (A22B amino acids) to the Bgtit site. 

Figure 3 is a graph showing relative lucif erase expression induced by FD constructs with or without ligand as 
determined in transient transfection assays for FDs and for monomeric EcR with either Usp or RXR when treated with 
20 vehicle (open bars) or \\M murA. Decimal numbers on the abscissa represent molar amount of FD relative to VE ptasmid 
(1.0 is equimolar FD:VE). EOU and UOE without VE cotransfection are at the extreme right of the bar graph. See also Table 
1. 

Figures 4A-B are two graphs illustrating the results of transient transfection assays conducted using either VP 1 6 
fused monomeric receptors or invention fusion protein FDs with increasing linker lengths. Figure 4A is a graph showing a 

25 comparison of luciferase activity in relative light units (RLU) in transient transfection assays conducted with or without 
ligand, using either monomeric receptors having amino tenninat fused VP16 activation domains or invention FOs containing 
EcR, RXR, and a linker with a variable number of linker segments. Cells were treated with vehicle (open bars), or 1 )iM 
muristerone A as ligand (black bars). Numbers at the top of the bars indicate the fold-increase relative to FD or monomeric 
receptor without addition of monomeric VRXR (VR) or monomeric VUsp (VU). E - EcR only; E4-luc - reporter plasmid only; 

30 and Figure 4B is a graph showing a comparison of luciferase activity as in Figure 4A herein, except that the FDs contain 
Usp in place of RXR. 
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Figure 5 is a series of three graphs showing repression of iigand-stimutated luciierase expression by monomeric 
receptors caused by competition with the invention ENU and UNE FOs when transiently co transfected into 293 cells and 
treated either with vehicle (open bars), or 1 fiM nrturisterone A as ligand (black bars). Decimal numbers on the abscissa 
represent molar amount of FD relative to VE plasmid (1.0 is equimolar FD:VE). Figure 5 A shows a comparison of the 
5 inhibitory effects of EOU and UOE on ligand-stimulated expression of luciferase by monomeric VE in combination with 

endogenous RXR. EQU and UOE without VE cotransfection are at the extreme right of the bar graph; Figure SB shows the 
effects of monomeric EcR (without VP16 fusion) on ligand-stimulated expression of luciferase in the assay of Figure 5A by 
competition with EOU or UOE; and Figure 5C shows the effects of monomeric EcR on luciferase expression in the presence 
of ligand in the assay of Figure 5B, as compared with VE combined with monomeric exogenous Usp. 

10 Figure 6 is a graph showing a comparison of results (in RLU) obtained in assays in which E5U and E5R compete 

with monomeric VRXR (VR) or monomeric VUsp (VUl in 

the presence of vehicle only (open bars) or murA as ligand (black bars). FDs and receptor combinations are labeled along the 
abscissa. Numbers above the bars represent the fold-increase relative to FD or receptor without addition of VR or VU. 
E4LUC at the extreme right is reporter plasmid alone as control. 

1 5 Figures 7 A-E are a series of six schematic diagrams representing possible conformations of receptor FDs 

described in the text. Shaded and white oval/rectangles represent receptors, small rectangles with interior arrows represent 
EcREs* and curvilinear lines represent linkers between protein units in the invention fusion proteins. Figure 7A represents a 
native dimer; Figure 7B represents a disorganized fusion protein; Figure 7C represents an endodimer orientation of a single 
invention FD; Figure 70 represents a tetramer of two invention FDs; Figure 7E represents a multimer of four invention FDs. 

20 Figure 8 is a series of schematic representations of invention FDs containing Bombyx ecdysone receptor (BEcR) 

plus the entire F domain of the Drosophila melanogaster ecdysone receptor (amino acids 650 to 878) (DE), which segment is 
included for ease in making the construct (DEcR). The BEcR is at the amino terminus of the fusion protein with either RXR 
or Usp as the dimer partner at the carhoxy terminus of the fusion protein. "D" - ONA binding domain; "L" - ligand binding 
domain; curvilinear line - fusion bridge. "H* - an N-terminal His tag for protein purification. 

25 

DFTAIIFD DgSCRIPTiOW OF THF iMVEftlTIOH 

In accordance with the present invention, there are provided chimeric proteins comprising at least two functional 
protein units, wherein each functional protein unit comprises the dimerization domain of a member of the steroid/thyroid 
hormone nuclear receptor superfamily, and an optional linker interposed therebetween, wherein the at least two protein 
30 units form a functional entity. W/hen the chimeric protein contains two functional protein units, the chimeric protein forms a 
functional dimer (FD), for example a heterodimer or a homodimer. 
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The invention chimeric proteins form functional entities (e.g. functional dimers) under a variety of conditions. Such 
conditions include, but are not limited to, those at or near physiological conditions (e.g., in saline at body temperature). 
Those of skill in the art will understand that formation of invention functional entities by dimerization or crystallization of a 
macromolecule can be influenced by manipulation of a variety of physical parameters, such as are disclosed in McPherson^ 
5 Eur. J. Biochem., i£9:1 -23, 1 990, which is incorporated herein by reference in its entirety. Due to the proximity of the 

protein units within the invention chimeric protein, dimerization tends to take place with intramolecular partners, rather than 
with other suitable monomeric dimer partners with which the protein units in the chimeric protein might othenA/ise interact. 

As used herein, plural nouns and verbs are intended to signify the singular form as well as the plural form of the 
particular noun or verb, unless prefixed by an adjective indicating a specific number, such as "two feet" or "three tigands", 
10 and a singular noun or verb is intended to include the plural form, unless prefixed by a phrase clearly indicating that only the 
singular noun or verb is intended, as in the phrase "one and only one foot" or "only one ligand." 

Each chimeric protein in the invention system is required to contain a dimerization domain of a member of the 
steroid/thyroid hormone nuclear receptor superfamily. As used herein, "dimerization domain" means a region of a member of 
the steroid/thyroid hormone nuclear receptor superfamily containing a sequence of amino acids that functions to cause 
dimerization of two members of the steroid/thyroid hormone nuclear receptor superfamily. Members of the steroid/thyroid 
homione nuclear receptor superfamily are commonly characterized by the presence of five domains: N-temiinal or activation 
domain (A/B), DNA binding domain (C), hinge domain (D), ligand binding domain (E), and C-temiinal domain (F) (Evans, R. Science 
240:889-895, 1988). The dimerization domain is generally located within the region of the receptor molecule that is referred 
to as including the D, E and F domains, or is referred to as the "D-E-F* domain. Typically the dimerization domain includes 
the complete ligand binding domain (E) and may optionally include all or part of the hinge domain (D) and/or the C terminal 
region (F) of a member of the steroid/thyroid nuclear receptor superfamily, or a functional equivalent thereof. In some cases 
the dimerization domain may include at least a portion of the DNA binding domain itself. Multiple domains of a given 
receptor can act in concert as well as independently. Therefore, as employed herein, the term "dimerization domain of a 
member of the steroid/thyroid homione nuclear receptor superfamily" refers to that portion lor portions) of a member of the 
steroid/thyroid hormone nuclear receptor superfamily that is involved in the formation of a dimer. 

As used herein, the term "fusion protein" means a genetically engineered molecule in which two or more ' 
polypeptide units are fused into a single polypeptide molecule by fusion of the open reading frames {ORFs) encoding the two 
or more separate protein units into a single OFF. The invention fusion proteins are capable of forming a "functional entity" 
in the optional presence of ligand. When the fusion proteins contain two protein units, a "functional dimer (FD)" is formed by 
30 dimerization. 

As used herein, the term "functional dimer" or "functional entity" as applied to an invention chimeric protein means 
that the functional entity or dimer possesses at least some of the biological function of a dimer formed between two 
equivalent monomeric li.e. undimerized) polypeptide units, or between two equivalent monomeric members of the 
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Steroid/thyroid hormone nuclear receptor superfamily. The biological function of such dimers includes one or more of the 
following properties: ONA binding, ligand binding, transactivation. and dimerization properties related to transactivation of a 
promoter operatively associated with a response element responsive to the invention chimeric protein. For example, 
invention chimeric protein(s) can modulate transactivation of gene(s) whose expression is controlled by the presence of ligand 
(e.g. an invention FD wherein at least one member is a Bombyxmori ecdysone receptor can modulate the expression of a gene 
under the control of a Bombyx ecdysone response element). 

Therefore, the term "functional protein units'* as applied to the functional entity or dimer formed by an invention 
chimeric protein means that the at least two protein units in the functional entity or dimer possess a cooperative function. 
For example, in a functional dimer the two dimerization domains (e.g., the two protein units) fold and interact with each 
other in a manner appropriate to substantially preserve one or more of the above named biological functions in the functional 
dimer that are present when corresponding monomeric members of the dimer come together under physiological conditions 
to form a native dimer complex. 

As used herein the term "endodimer" means a dimer formed in an orientation approximating that of a native dimer 
complex formed between equivalent monomeric polypeptides, i.e.. an "internar dimer. Figure 7A illustrates a native dimer 
complex and Figure 7C illustrates an invention endodimer. 

As used herein the term "dimer partner" means any polypeptide that, under physiological conditions, forms a dimer 
with a member of the steroid/thyroid hormone nuclear receptor superfamily. Such dimer partners include, but are not limited 
to, monomeric member(s) of the steroid/thyroid hormone nuclear receptor superfamily, including those known in the art as a 
"silent partner,*" which are characterized by forming dimeric species with a member of the steroid/thyroid superfamily of receptors 
wherein the silent partner may not directly participate in binding ligand (i.e., only the co-partner in the fusion protein binds ligand). 
Exemplary dimer partner(s) include RXR, Usp, Nurrl, and the like. 

The term "dimer partner" is meant to include members of the steroid/thyroid hormone nuclear receptor superfamily to 
which other wild type members preferentially bind to f onm heterodimeric species. For example, wild type members of the 
steroid/thyroid hormone nuclear receptor superfamily preferentially form heterodimers with a common partner, the retinoid X (or 
9 cis retinoic acid) receptor (RXR, see, for example, Yu etaL Cell, B2:125M266, 1991; Bugge et aL EMBOJ., 11:1409141 8, 
1992; Kliewer etaL /IfeftfA^ 355:446-449, 1992; Leid etaL re//fifl:377-395, 1992; Marks etaL EMBOJ. 11:14191435. 
1992; Zhang ^r^t/l/artfrff 555:441 -446, 1992; lssemann£/5/.,^/i?rA/j7?/p, 25:251-256, 1993). Additional dimer partners for 
members of the steroid/thyroid hormone nuclear receptor superfamily include ultraspiracle (Usp), famesoid X receptor (FXR), and 
the tike. 

As used herein, the phrase "member(s) of the steroid/thyroid hormone nuclear receptor superfamily" (also known as 
"intracelluiar receptors" or "the nuclear receptor superfamily") refers to homione binding proteins that operate as ligand- 
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dependent transcription factors, including identified members of the steroid/thyroid hormone nuclear receptor superfamily for 
which specific liganris have not yet been identified (referred to in the art as "orphan receptors"). 

Exemplary members of the steroid/thyroid hormone superfamily of receptors (including the various isof orms 
thereof) include steroid receptors such as glucocorticoid receptor (GR), mineralocorticoid receptor (MRl, estrogen receptor 
5 (ERL progesterone receptor (PR), androgen receptor (AR). vitamin Dj receptor (VDR), and the like; plus retinoid receptors, 
such as the various isof orms of retinoic acid receptor (e.o., RAR , RAR or RAR ), the various isof orms of retinoid X (or Sc/s 
retinoic acid) receptor (e.Q., RXR , RXR , or RXR ), various isoforms of peroxisome proliferator activated receptors le,g., 
PPAR , PPAR , PPAR ) and the like (see, e.g. U,S, Patent Nos. 4,981784; 5,171,671; and 5,071 ,773); thyroid hormone 
receptor (T3R), such as TR , TR . and the like; steroid and xenobiotic receptor (SXR, see for example. Blumberg et al., Ge/jes 

10 /7ev (1998) 121201:3195-205). RXR interacting proteins (RIPs; see, e.g., Seo) et al..)J^(?/f/?/tor/7>7^/(1995)ailJ:72-85; 
Zavacki et al., Proc Natl Acad Sci USA (1997l24tl51:7909-14) including farnesoid X receptor (FXR; see for example, 
Forman et al., Cell (1 995) Mi5):687-93; Hanley et al., J Clin Invest 1 1 997) lflQI3):705- 1 2, O'Brien et al.. Carcinogenesis 
(1996) 1212): 185-90). pregnenolone X receptor (PXR; see for example. Schuetz et al.. Mol Pharmacol W^^Vi 5^:1 1 13- 7). 
liver X receptor (LXR, see, e.g.. Peet et at., Curr Opin Genet Dev (1998) flI5i:571-5), BXR (Blumberg et al.. Genes Oev (1998) 

1 5 1 2|9):1 269-77), insect derived receptors such as the ecdysone receptor (EcR), the ultraspiracle receptor (see, for example^ 
Oro et al.. in Nature ^:298 301 (1990)), and the like; as well as other gene products which, by their structure and 
properties, are considered to be members of the superfamily, as defined hereinabove, including tfie various isoforms thereof 
(see, e.g.. Laudet. V.. J MoJ Endocrinol im7)ia!31:2m'2Bl 

Examples of orphan receptors contemplated for use herein include HNF4 (see. for example, Stadek et al.. Genes S 
20 Development 4:2353-2365 (1990)). the COUP family of receptors (see. for example, fi/liyajima et al.. in Nucleic Acids 
Research I£:1 1057- 1 1074 (1988). and Wang et al. Nature MQ:163-166 (1989)). COUP-like receptors and COUP 
homologs. such as those described by MIodziket a).. /r^//£D:21 1-224 (1990) and Ladias et al., 5c/£y7Cf 2^:561-565 
(1991), orphan receptor (ORl; see, e.g.. Feltkamp et zl.JBiolChem (1999) 2Z4il5J: 1042 1-9). the insect derived knirps and 
knirps-related receptors, short heterodimer partner (SHP; see, e.g., Seol et al.. Mol Cell Biol 111121:71 26-31), 
25 hepatocyte nuclear receptor 4 (HNF4). constitutive androstane receptor (CAR; see, e.g.. Forman et al.. Nature (1998) 
335162021:61 2-5), and the like. 

Each protein unit in the invention chimeric protein is required to contain at least a dimerization domain, optionally, the 
entire ligand binding domain, an optional hinge domain, and an optionally functional DNA binding domain of a member of the 
steroid/thyroid nuclear receptor superfamily. or a functional equivalent thereof. For use in the invention methods for 
30 modulating the transcription of exogenous or endogenous nucleic acids in a host, the ligand binding domains are either endogenous 
or non-endogenous to a host, with the latter including ligand binding domains that are modified to be non-responsive to iigands 
endogenous or native to the host, in embodiments wherein the ligand binding domain is derived from non mammaltan member(s) 
of the steroid/thyroid hormone nuclear receptor superfamily. which members are not nomnally present in the cells of a host, the 
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iigand binding domains are preferably derived from the carboxy-terminat portion of non-mammalian members. Exemplary members 
that are not normally present in mammalian ceils include insect, avian, amphibian, reptilian, fish, plant, bacteria, viral and fungal 
{including yeast) m^nfaers of the steroid/thyroid hormone nuclear receptor superf amily, and the tike. 

Exemplary Iigand binding domains derived from insect receptors include those derived from lepidopteran species such as 
5 Drosophila melanogaster {M.R. Koelle, 1 995), Bombyx mori (Swevers et aL, insect Biochem, Molec, Biol, 25(71:857-86S, 1 9951, 
Choristoneura fumlferana (Palli at aL insect Biociiem. Afoiec. BioL, 2&(5):485*499, 1 996), i^anduca sexta (Fujiwara Bt ai.. Insect 
Biocitem, ii/loiec. Biol., 25(7):845'856, ]9aSiAedesaeg^tHChQetai.JnsectB/ocitemi^oiec,Bfo/., 25:19-27, 1995), 
Citorinomus tentans {Irvhoi et ai,Jnsect Biodtem. i^foiec. Biot., 25:115-124, 1993), and the like. 

When the functional protein units included in the invention chimeric protein lack a substantial portion of the C-termina) 
1 0 7" domain in the dimerization domain of a native member of the steroid/thyroid hormone nuclear receptor superfamily, a 
functional protein unit that is less than about 700 amino acids in length is provided. 



Ligand binding domains can be functionally located in either orientation and at various positions within the protein unit. 
For example, the tigand binding domain can be positioned at either the amino or carboxy temiinus of the protein unit in the 
1 5 invention chimeric protein, or therebetween. In a preferred embodiment of the present invention, the ligand binding domain is 
positioned at the carboxy terminus of the protein unit (see Figure 1). 

The optional hinge region, when present, can also be functionally located in either orientation and at various positions 
within the protein unit. For example, the hinge region can be positioned at either the amino or carboxy terminus of the protein 
unit, or therebetween. Preferably, the hinge region is positioned internally between the tigand binding and DMA binding domains 
20 of one or more of the members in the chimeric protein. The hinge region bounded by the ligand binding domain and DMA binding 
domain of the native Bombyx mori s^z^^m (BEcRI, specifically, about 27 amino acid residues (i.e. amino acid residues 283-309. 
in the hinge region of BEcR) are sufficient to confer high affinity for complex fomiation with an endogenous dimer partner <5ee 
U.S. Patent Application Serial No. 08/89 1 ,298, filed July 1 0, 1 997, copending herewith). 

Each protein unit in the invention chimeric protein also optionally contains a DMA binding-domain. DMA-binding domains 
25 contemplated for use in the preparation of invention chimeric proteins are well known in the art and are typically obtained from 
DNA-binding proteins (e.g., transcription factors). The tenn "DMA-binding domain" is understood in the art to refer to an amino 
acid sequence that is able to bind to DNA. As used herein, the term "DNA-binding domain" encompasses a minimal peptide 
sequence of a ONA-binding protein up to the entire length of a DMA-binding protein, so long as the DNA binding domain functions 
to associate with a particular regulatory element. 

30 DMA-binding domains are known to function heterologously in combination with other functional domains by 

maintaining the ability to bind the natural DMA recognition sequence (see, e.g.. Brent and Ptashne, Cell, 43:729 736, 1985). For 
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example, with respect to steroid/thyroid hormone nuclear receptors, ONA-bimling domains are interchangeable, thereby providing 
numerous chimeric receptor proteins (see. e.g., U.S. Patent 4,981,784; and R. Evans, Sdence, 249:889'895, 1988). Similar to 
the iigand binding domain, the DNA-binding domain can be positioned at either the carboxy terminus or the amino terminus of a 
protein unit in the invention chimeric protein, or the ONA-binding domain can be positioned between the iigand binding domain and 
5 the activation domair). in preferred embodiments of the present invention, the DNA-binding domain is positioned internally 
between the Iigand binding domain and the activation domain. 

'ONA-binding proteinfs)" contemplated for use herein belong to the well-known class of proteins that are able to directly 
bind DMA and facilitate initiation or repression of transcription. Exemplary ONA-binding proteins contemplated for use herein 
include transcription control proteins (e.g., transcription factors and the like; see, for example, Conaway and Conaway, 
1 0 Transcription Mechanisms andHegti/atm Raven Press Series on Molecular and Cellular Biology, Vol. 3, Raven Press, Ltd., New 
York,NYJ994l 

Transcription factors contemplated for use herein as a source of such ONA binding domains include, e.o., homeobox 
proteins, zinc finger proteins, hormone receptors, helix-turn-helix proteins, helix-loop-hetix proteins, basic-Zip proteins (bZipL 
ribbon factors, and the like. See, for example, S. Harrison, *A Stnjctural Taxonomy of DMA-binding Domains," tore, 353:7 1 5- 

15 719. Homeobox DNA-binding proteins suitable for use herein include, for example, HOX, STF-1 (Leonard et aL MoL Endo,, 

2:1275-1283, 1993), Antp, Mat a-2, INV, and the like. See, also. Scott atai Biocham. Biophys, Acta, 389:2548. 1989. It has 
been found that a fragment of 76 amino acids (corresponding to amino acids 140*215 described in Leonard f/^/., 1993) 
containing the STF-1 homeodomain binds DNA as tightly as wild-type STF-1. Suitable zinc finger DNA-binding proteins for use 
herein include Zif268, Gil, XFin, and the like. See also, Klug and Rhodes, Trends Biocham, Sci., 12:464, 1987; Jacobs and 

20 Michaels, yi^^w^ftb/, 2:583. 1990; and Jacobs. fAf5Z7a,il:4507-4517, 1992. 

An additional DNA binding domain contemplated for use in the practice of the present invention is the GAL4 DNA 
binding domain. The DNA binding domain of the yeast GAL4 protein comprises at least the first 74 amino terminal amino acids 
thereof (see, for example, Keeganf/^/., ;?c/f/7Cf 221:699-704, 1986). Preferably, the first 90 or more amino terminal amino 
acids of the GAL4 protein will be used, for example, the 147 amino terminal amino acid residues of yeast GAL4 . 

25 The DNA-binding domain(s) used in the invention chimeric proteins can be obtained from a member of the steroid/thyroid 

homione nuclear receptor superf amily, or are substantially the same as those obtained from a member of the superf amily. The 
DNA-binding domains of all members of the steroidlthyroid homione nuclear receptor superf amity are related. Such domains 
consist of 66-68 amino acid residues, and possess about 20 invariant amino add residues, including nine cysteines. Members of 
the superfamily are characterized as proteins which contain these 20 invariant amino acid residues. The highly conserved amino 

30 acids of the ONA-binding domain of members of the superfamily are as follows: 

Cys-X.XCys-X-X.Asp*.X.Ala*-X.GIy*-X-Tyr**X-X.X-X-Cys-X.X- 
Cys.Lys' X.Phe.Phe.X-Arg--X.X-X-X*X.(X-X-l Cys-X-X-X- X- X.(X. 
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X.X ICys.X.X.X.Lys X.X.Afg.X.X Cvs.X.X.Cys Arg- X X. 
Lys* • Cys . X - X • X . Gly' • Met (SEQ ID N0:1): 

wherein X designates non-conserved amino acids within the ONA binding domain; an asterisk denotes the amino acid residues 
which are almost universally conserved, but for which variations have been found in some identified hormone receptors; and the 
residues enclosed in parenthesis are optional residues (thus, the DMA-binding domain is a minimum of 66 amino acids in length, 
but can contain several additional residues). 

Invention chimeric proteins are optionally modified by the introduction of an activation domain subunit. Acthration 
domains contemplated for use in the practice of the present invention are well known in the art and can readily be identified by 
those of skill in the art. Such activation domains are typically derived from transcription factors and comprise a contiguous 
sequence that functions to activate gene expression when associated with a suitable ON A binding domain and a suitable ligand 
binding domain. An activation domain can be positioned at any convenient site within the invention chimeric proteia e. g.. at the 
carboxy terminus, the amino terminus, or between the tigand binding domain and the DNA binding domain within one or both 
protein units of the chimeric protein. In presently preferred embodiments of the invention, the activation domain is positioned at 
the amino tenninus of the invention chimeric protein. 

Suitable activation domains can be obtained from a variety of sources* e.g., from the N-terminal region of members of 
the steroid/thyroid hormone nuclear receptor superf amily, from transcription factor activation domains, such as, for example, 
VP16, 6AL4, NF kB or BP64 acthfation domains, and the like. The activation domain presently preferred for use in the practice of 
the present invention is obtained from the C-terminal region of the VP1 6 protein, and is known as VP 1 6i. 

in a presently preferred embodiment of the present invention, chimeric proteins contain one or more ecdysone 
receptors lEcR) as the steroid/thyroid hormone nuclear receptor, for example, a Drosophila EcR (DEcR) or a Bombyx EcR 
(BEcR). The chimeric protein further comprises either RXR or ultraspiracle protein (Usp) as an additional functional protein 
unit. The preferred order within the chimeric protein is for the EcR to be located at the amino terminus of the chimeric 
protein. However, when the invention chimeric protein further comprises an activation domain, the activation domain is 
preferably located at the amino terminus of the chimeric protein. 

The EcR. an insect receptor, differs in two respects from other known steroid/thyroid hormone nuclear receptor 
superfamily. First, EcR has very different documented relationships with two similar dimer partners: its natural partner, 
Usp, and the mammalian homolog, RXR. EcR also differs from other members of the steroid/thyroid hormone nuclear 
receptor superfamily in that its apparent affinity to these heterodimer partners varies depending on the presence of ligand . 

To facilitate dimerization of the dimerization domains in the invention chimeric proteins, the at least two protein 
units of the chimeric protein preferably have the ligand binding domain, hinge domain, and DNA binding domain in the same 
order within each protein unit. If the chimeric protein additionally contains an activation domain, the activation domain is 
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preferably located at the amino terminus of the chimeric protein, ahead of the first unit thereof, as illustrated in Examples 1 
and 4 herein. 

Invention chimeric protein(s) optionally further contain a linker interposed between one or more of the protein units. 
The protein units can be independently oriented amino terminus to carboxy terminus within the chimeric protein, or visa 
5 versa . For example, the linker can be placed between the carboxy terminus of the first protein unit and the amino terminus 
of the second protein unit. Any type of linker known in the art can be used for linking the protein units in invention chimeric 
proteins so long as the linker is flexible and does not interfere with dimeriiation between protein units in the invention 
chimeric proteins. 

In one embodiment according to the present invention, the linker is a heterobifunctionai cieavable cross-linker, such 
1 0 as N-succinimidy) (4-iodoacetyl) aminobenzoate; sulfosuccinimidyl{4'iodoacetyl)'aminoben2oate; 4-succinimidyi-oxycarbonYl- 
a (2-pyridyldithio) toluene ; sulfosuccinimidyt-6-|a'methyl-a*(pyridyldithiol)-toluamidol hexanoate; N'Succinimidyl'3'(-2- 
pyridyldithio)proprionate;succinimidy)-6|3l-(-2-pyridyldithio)-proprionamido]hexanoate; sulfosuccinimidyl-6|3M-2- 
pyridyldithtol-propianamido) hexanoate; 3 (2-pyridyidithio)-propionyl hydrazide. Eltman's reagent, dichlorotriazinic acid, S-(2- 
thiopyridyD-bcysteine, and the like. Further bifunctional linking compounds are disclosed in U.S. Patent Nos. 5,349,06 6* 
1 5 5,618,528, 4,569,789, 4,952,394, and 5,1 37,877, each of which is incorporated herein by reference in its entirety. These 
chemical linkers can be attached to purified proteins using numerous protocols known in the art, such as those described in 
Pierce Chemicals 'Solutions, Cross-linking of Proteins: Basic Concepts and Strategies," Seminar #12, Rockford, IL 

in another embodiment according to the present invention, the linker can be a peptide having from about 2 to about 
60 amino acid residues, for example from about 5 to about 40, or from about 10 to about 30 amino acid residues, such as is 

20 known in single-chain antibody research. Examples of such known linker moieties include G66GS (SEQ ID N0:2), (GG6GSI„ 
(SEQ. ID N0:3), GKSSGSGSESKS (SEQ ID N0:4), GSTSGSGKSSEGKG (SEQ, ID N0:5), GSTSGSGKSSEGSGSTKG (SEO ID 
N0:6h GSTSGSGKSSEGKG {SEQ ID N0:7), GSTSGSGKPGSGEGSTKG (SEQ ID N0:8I, EGKSSGSGSESKEF (SED ID W0:9), 
SRSS6 (SEQ. ID f\(0:10), SGSSC (SEQ ID N0:1 1), and the like. A Diphtheria toxin trypsin sensitive linker having the 
sequence AMGRS6GGCAGNRVGSSLSC6GLNLQAM (SEQ ID N0:12) is also useful. Alternatively, the peptide linker moiety 

25 can be VM or AM, or have the structure described by the formula: AMtGj ,o 4S),AM wherein X is an integer from 1 to 11 (SEQ 
ID NQ:13). Additional linking moieties are described, for example, in Huston et aL /ViM^ £5:58 79-5883, 1988; Whitlow, 
f^.. at at., Protem f/fgweermg iM9'B95, 1993; Newton 5/, tf/^^^^mr/y 25:545-553, 1996; A.J. Cumber 5/., 
B/oconJ, Chem. 3:397-401, 1992; Ladurner ef 5/., J. MoL BioL 223:330-337, 1997; and U.S. Patent. No. 4,894,443, tfie 
latter of which is incorporated herein by reference in its entirety. 

30 Generally, however, the linker contains from about 5 to about 245 amino acids, althougfi there is no theoretical 

upper limit on the number of amino acids that could be used in the linker. Preferably, the linker contains from about 53 to 
about 125 amino acids. The amino acids in the linker protein are preferably selected to provide flexibility to the linker. 
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Preferably, a multiplicity of flexibility enhancing amino acids, such as proline, glycine, alanine and serine, are incorporated 
into the linker to enhance its flexibility. 

Assuming a span of approximately 3.35 angstroms per amino acid within the flexible peptide bridge encoded by a 
36 base pair Sfil compatible oligonucleotide, the predicted minimum and maximum distance for the lengths of the linker 
5 having from 0 to 20 linker segments ranges from about 16.75 angstroms (the 5 amino acid bridge) to 804 angstroms 120- 
linker segments + the 5 amino acid bridge). Thus, the length of the linker can readily be selected to enhance dimeri2ation 
between any two particular members acting as dimer partners by including as many linker segments as is preferred to 
enhance the biological functions of the functional dimer, as discussed herein. 

In a presently preferred embodiment, the nucleotide encoding the polypeptide linker contains a restriction 
1 0 endonuctease recognition site which produces an overhang composed of non palindromic center bases to allow for insertion 
of compatible inserts in a uniform orientation and in continually in-frama blocks along the length of the polypeptide linker. 
This type of linker allows incremental expansion of the linker peptide to produce chimeric proteins containing linkers with a 
range of distances between the protein units. The nucleotide encoding the tinker preferably contains a rare 8-base-pair Sfil 
recognition site that is useful in making constnicts with tinkers of variable length. In addition, the nucleotides composing the 
1 5 recognition site: 

GGCCNNNNNGGCC- (SEQ ID N0:t4) 

are guanidines and cytosines, which can be oriented in frame to encode glycine and proline residues in accordance with the 
criteria of producing a ''f lexibte" protein linker for junction of the two units of the chimeric protein. Any bases can be used 
as the "M" nucleotides contained within the recognition site, allowing further flexibility in the design of the linker. 

20 A presently preferred linker amino acid sequence is GP6GGSGGGSGT (SEQ ID N0:15), which provides a high 

degree of predicted flexibility while minimizing repetitive sequence within the encoding oligonucleotide. 

in accordance with another embodiment of the present invention, there are provided nucleotides encoding invention 
chimeric protein(s) and cells containing such nucleotides. Ceils containing invention polynucleotides can be either mammalian 
or non mammaiian, for example, plant or fungi ceils, and the like. 

25 In accordance with another embodiment of the present invention, there are provided methods for modulating th e 

expression of exogenous genets) in a subject organism containing: 

1 ) a functional entity according to the invention and 

2) a DMA construct encoding and expressing the exogenous gene(s) under the control of a 
response element responsive to the functional dimer. 
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Invention methods for modulating exogenous gene expression in such a subject organism comprise administering to the 
subject organism an effective amount of at least one ligand for the functional dimer. 

Ugand is selected to activate at least one functional unit of the functional entity. When the f unctionai entity is a 
functional dimer, the ligand is usually selected to activate the dominant member of the functional dimer. For example, if one of 
5 the two members in the functional dimer is a silent dimer partner, the ligand is selected to activate the member in the functional 
dimer that does not act as a silent dimer partner. 

In a presently preferred embodiment of the present invention, one of the members in the functional entity is from an 
insect species, and the preferred ligand is an insect hormone. For example, preferred insect receptors are the Drosophila 
ecdysone receptor or the Bombyx ecdysone receptor, and the preferred dimer partner with these insect receptors in the invention 
1 0 chimeric protein is either the uhraspiracle protein or a retinoid X receptor. These functional entities complex with the ecdysone 
response element, generally in the presence of ligand for the functional dimer formed by the invention chimeric protein, but in 
some instances the presence of ligand is not required to f onn a functional entity/response element complex, as explained more 
fully hereinbetow. 

As employed herein, the terms "modulate" and "modulating" refer to the ability of a given functional entity to 
1 5 activate/deactivate and/or up-regulate/down*reguiate transcription of exogenous nucleic acids, relative to the transactivation 
activity in the absence of the functional entity. 

The actual effect of an invention functional entity on the transcription of exogenous or endogenous nucleic acids will 
vary depending on the particular combination of dtmerization domains and/or ntembers of the steroid/thyroid hormone nuclear 
receptor superf amily in the chimeric proteia on the presence or absence of specific ligand for the ligand binding domainis) 

20 employed in the chimeric proteia and on the regulatory element (e.g., response element) with which the selected chimeric protein 
interacts. It is specifically contemplated within the scope of the present invention that modulation includes repression of 
expression of one or more genes. Such repression can be either ligand-dependent repression or repression that occurs 
independently of the presence of a ligand. Thus, there are four types of modulation contemplated within the scope of the 
invention: ligand-dependent induced modulation, ligand-dependent repressed modulation, ligand independent induced modulation. 

25 and ligand independent repressed modulation. The ligand can be either exogenous or endogenous to the subject treated for 
modulation of expression of an exogenous gene. 

More particularly, the type of modulation that results from the practice of the invention method (i.e.. whether activation 
or repression of expression of the exogenous gene) depends upon the combination of dimerization domains andlor members of the 
steroid/thyroid hormone nuclear receptor superf amily contained within a functional entity formed by the invention chimeric 
30 protein. For example, it has been detemiined that activation of expression of the exogenous gene(s) according to the invention 
modulation method can be achieved if the dimer partners in an invention FD used in the invention method of modulation are an 
ecdysone receptor and a retinoid X receptor. e.g- EOR or E5R. Ligands suitable for acth/ating expression of the exogenous gene(s) 
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when such functional dimers are employed in the invention methods include muristerone A, 20 hydroxyBcdysonB, 
phytDecdysteroid(sl and the like. 

On the other hand, it has been determined that expression of an exogenous genels) can be repressed independently of 
(i.e., with or without) the presence of ligand if an invention FD comprising an ecdysone receptor (e.g.. either a Drosophila 
5 ecdysone receptor or a Bombyx ecdysone receptor) and an ultraspiracia protein as dimer partner is used in the invention method. 
e.g., E5U. and the like. Ugands suitable for acth/ating expression of the exogenous genels) when such functional dimers are 
employed in the invention methods are 20-hydroxyecdysone, muristerone A. phytoecdysteroidls), and the like. In addition, 
expression of exogenous genels) can be repressed independently of (i.e., with or without) the presence of ligand if an invention FD 
comprising a Bombyx ecdysone receptor and an RXR as dimer partner is used in the invention method. As shown in Example 6 
1 0 herein, an N-terminal His tag on the chimeric protein to aid in protein purification does not effect binding of the chimeric protein to 
a suitable response element so as to repress expression of the exogenous gene according to invention methods for modulating 
expression of exogenous genels). 

Accordingly, in another embodiment of the present invention, there are provided methods for modulating (i.e., either 
activating or repressing) the expression of one or more genes in a subject organism independently of the presence of ligand for 

1 5 the invention chimeric protein. If the subject organism contains an invention chimeric protein, the invention method 

comprises introducing to the subject an exogenous response elementis) with which the chimeric protein interacts and which 
controls expression of the one or more genes, thereby modulating expression of the genels) independent of the presence of 
ligand for the chimeric protein. Qn the other hand, if the subject organism contains an exogenous response elementis) 
controlling expression of the one or more genes, the invention method comprises introducing to the subject an invention 

20 chimeric protein with which the response element interacts, thereby modulating expression of the genels) independent of the 
presence of ligand for the chimeric protein. 

in accordance with another embodiment of the present invention, there are provided methods for modulating {i.e.. either 
activating or repressing) the expression of one or more exogenous genes independent of ligand tor the chimeric protein. It the 
subject contains a chimeric protein according to the invention,, the invention method comprises introducing to the subject an 
25 effective amount of a response element, wherein the response element is responsive to the chimeric protein and wherein the 
modulation is independent of ligand for the chimeric protein. The modulation can be ligand independent activation or iigand 
independent repression. 

In accordance with another embodiment of the present invention, there are provided methods for modulating {i.e., either 
activating or repressing) the expression of one or more exogenous genes in a cell containing: 

30 1) an invention chimeric protein and 

2) a DNA construct comprising the exogenous gene under the control of a response element with 
which the chimeric protein interacts, wherein said response element controls expression of the exogenous gene^ 
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said method cotnpnsing administering to the ceil an effective amount of an exogenous ligand for at least one 
functional unit of the chimeric protein. 

In accordance with another embodiment of the present inventtoa there are provided methods for modulating the 
expression of one or more genes in a subject organism containing an endogenous response element controlling expression of 
5 one or more genes. The invention method in this situation comprises introducing to the subject an invention chimeric protein, 
wherein the chimeric protein interacts with the response element, thereby modulating expression of the genels) dependent 
on the presence of endogenous ligand therefor. The chimeric protein is encoded by an inducible DMA construct and the 
modulating comprises inducing expression of the genelst. 

In another embodiment according to the present invention, there are provided methods for modulating the 
1 0 expression of one or more genes in a subject organism containing an endogenous response element controlling expression of 
one or more genes and an endogenous ligand. The invention method comprises introducing to the subject an invention 
chimeric protein that interacts with the endogenous ligand and wherein the chimeric protein interacts with the response 
element, thereby modulating expression of the genels) dependent on the presence of the endogenous ligand. If the invention 
chimeric protein is encoded by an inducible DNA construct, the modulating further comprises inducing expression of the 
1 5 chimeric protein. This embodiment of the invention is especially useful for controlling expression of an exogenous gene that is 
under the control of an endogenous response element wherein the ligand for the invention functional dimer is also endogenous. 

Response elements contemplated for use in the practice of the present invention (relating to modulation of the 
expression of exogenous genes in a subject) include native, as well as modified response elements. For example, since invention 
functional dimers can function as either homodimers or as heterodimers (with a silent partner therefor), any response element that 
20 is responsive to an invention functional dimer. in the form of a homodimer or heterodimer, is contemplated for use in the invention 
methods described herein. As is readily recognized by those of skill in the art. invention functional dimers (whether in the form of 
a homodimer or a heterodimer) can bind to a response element having an inverted repeat motif (t.e., two or more half sites rn 
mirror image orientation with respect to one another), to a response element having a direct repeat motif, and the like. 

Response elements useful in conjunction with invention functional entities are those well known in the art. As readily 
25 recognized by those of skill in the art. the response element employed will vary as a function of the protein units incorporated 
into the functional entity. Thus, for example, retinoic acid receptor response elements are composed of at least one direct 
repeat of two or more defined half sites separated by a spacer of five nucleotides. The spacer nucleotides can independently be 
selected from any one of A. C. 6 or T. Each half site of response elements contemplated for use in the practice of the invention 
comprises the sequence: 

30 RGBNNM-. 
wherein 

R is selected from A or G; 
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B is selected from G, C, or T; 

each N is independently selected from A, T, C, or G; and 

M is selected from A or C; 

with the proviso that at least 4 nucleotides of said .RGBNNM. sequence are identical with the nucleotides at corresponding 
positions of the sequence -AGGTCA-. Response elements employed in the practice of the present invention can optionally be 
preceded by N„ wherein x falls in the range of 0 up to 5. 

For example, thyroid hormone receptor response elements can be composed of the same half site repeats, with a 
spacer of four nucleotides. Alternatively, palindromic constructs as have been described in the art are also functional as TR 
response elements. 

Exemplary GAL4 response elements are those containing the palindromic 1 7-mer: 

B'-CGGAGGACTGTCCTCCG 3' (SEQ ID N0:16L 

such as. for example, 1 7MX, as described by Webster et aU in Cell 52:1 69-178 (1988), as well as derivatives thereof. 
Additional examples of suitable response elements include those described by Hollenberg and Evans in Cell 55:899-906 (1988); 
or Webster et al. in Cell 54:199-207 (1988). 

Ecdysone response element sequences are preferred for use herein with functional dimers containing an ecdysone 
receptor function in a position- and orientation-independent fashion. The native ecdysone response element has been previously 
described, see, e.g., Yao et al., Cell, 21:63-72, 1992. 

In the invention methods the operative response element is functionally linked to an operative exogenous gene(s) 
whose expression it is desirable to control. The word "operative" means that the respective DMA sequences (represented by 
the terms "response element" and "exogenous or endogenous gene") are operational i.e., work for their intended purposes; the 
word "functionally" means that after the two segments are linked, upon appropriate activation by a functional dimer/ligand 
complex, the exogenous oene(s) will be expressed as the result of the fact that the "response element" was "turned on" or 
otherwise activated. 

Certain nucleic acid constructs contemplated for use in one aspect of the present invention include promoters and 
regulatory elements operatively associated with exogenous nucleic acids. In one embodiment of the present invention, the 
invention functional dimer, in the presence of a ligand therefor, binds the regulatory element and activates transcription of one or 
more exogenous nucleic acids. For example, an invention functional dimer containing the protein units RXR and EcR will 
transactivate an ecdysone response element-containing promoter in the presence of the hormone ecdysone. or the synthetic 
analog, murtsterone A. 
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Regulatory elements contemplated for use in the practice of the present invention include elements responsive to the 
invention receptor peptide. In a preferred embodiment of the present invention, such elements are exogenous regulatory elements 
not normally present in the cells of the host. One class of exogenous regulatory elements contemplated for use herein includes 
homtone response elements that modulate transcription of exogenous nucleic acid when bound to the DNA binding domain of an 
5 invention receptor peptide. 

Regulatory elements employed in the practice of the present invention are operably linked to a suitable promoter for 
transcription of exogenous nucleic acid(s) productls). As used herein, the term "promoter" refers to a specific nucleotide sequence 
recognized by RNA polymerase, the enzyme that initiates RNA synthesis. The promoter sequence is the site at which 
transcription can be specifically initiated under proper conditions. When exogenous nucleic acid(s), operatively linked to a suitable 
1 0 promoter, is(are) introduced into the cells of a suitable host expression of the exogenous nucleic acid(s) is(are) controlled in many, 
but not all cases, by the presence of ligands, which are not normally present in the host cells. 

Promoters contemplated for control of expression of exogenous nucleic acids employed in the practice of the present 
invention include inducible (e.g., minimal CMV promoter, minimal TK promoter, modified MMLV LTR), constitutive (e.g., chicken p* 
actin promoter, MMLV LTR (non-modified), DHFR), and/or tissue specific promoters. 

1 5 Inducible promoters contemplated for use in the practice of the present invention con^irise transcription regulatory 

regions that function maximally to promote transcription of mRNA under inducing conditions. Examples of suitable inducible 
promoters include DNA sequences con^esponding to: the £ co/iiac operator responsive to IPTG (see Nakamura et aL Celi 
1 8:1 1 09- 1 1 1 7, 1 979); the metallothionein promoter metal-rcgulatory-elements responsive to heavy-metal (e.g., zinc) induction 
(see Evans et aL U.S. Patent No. 4,870,009), the phage T7lac promoter responsive to IPTG (see Studier et aL, Meth. ErnymoA , 

20 1 85: 60-89, 1 990; and U.S. Patent No. 4,952,496). the heat-shock promoter; the TK minimal promoter; the CMV minimal 
promoter; a synthetic promoter; and the like. 

Exeniplarv constitutive promoters contemplated for use in the practice of the present invention include the CMV 
promoter, the SV40 promoter, the DHFR promoter, the mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney 
murine leukemia virus (MMLV) promoter, elongation factor la (EFIa) promoter, albumin promoter, APO A1 promoter, cyclic AI^P 
25 dependent kinase 11 (CaMKII) promoter, keratin promoter, CD3 promoter, immunoglobulin light or heavy chain promoters, 

neurof iliment promoter, neuron specific enolase promoter, L7 promoter, CD2 promoter, myosin light chain kinase promoter, HO X 
gene promoter, thymidine kinase (TK) promoter, RNA Pol II promoter, MYOD promoter, MYF5 promoter, phosphoglycerokinase 
(PGK) promoter, Stf 1 promoter. Low Density Lipoprotein (LDL) promoter, chicken b actin promoter (used in conjunction with 
ecdysone response element), and the like. 

30 As readily understood by those of skill in the art, the tenn "tissue specific" refers to the substantially exclusive initiation 

of transcription in the tissue from which a particular promoter that drives expression of a given gene is derived (e.g., expressed 
only in T-cells, endothelial cells, smooth muscle cells, and the like). Exemplary tissue specific promoters contemplated for use in 
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the practice of the present invention include the GH promoter, the NSE promoter, the GFAP promoter, neurotransmitter promoters 
(e.g.. tyrosine hydroxylase, TH. choline acetyltransferase, ChAT, and the likeh promoters for neurotropic factors (B.g., a nerve 
growth factor promoter, NT-S, BDNF promoters, and the like), and so on. 

As used herein, when referring to nucleic acids, the phrase "exogenous to said mammalian host" or simply "exogenous" 
refers to nucleic acids not naturally found at levels sufficient to provide a function in the particular cell where transcription is 
desired. For example, exogenous nucleic acids can be either natural or synthetic nucleic acids, which are introduced into the host 
in the form of DNA or RN A. The nucleic acids of interest can be introduced into target cells (for in vitro applications), or the 
nucleic acids of interest can be introduced directly or indirectly into a host, for example, by the transfer of transformed cells into a 
host. 

In contrast to exogenous nucleic acids, the phrase ''endogenous nucleic acids'* or "endogenous genes' refers to nucleic 
acids naturally found at levels sufficient to provide a function in the particular cell where transcription is desired. 

Exogenous nucleic acids contemplated for use in the practice of the present invention include wild type and/or 
therapeutic nucleic acids. "Wild type" genes are those that are native to cells of a particular type. Exemplary wild type nucleic 
acids are genes which encode products the substantial absence of which leads to the occurrence of a non-nomial state in a host; 
or a substantial excess of which leads to the occurrence of a non-normal state in a host. 

Such genes may not be expressed in biologically significant levels or may be undesirably overexpressed. Thus, for 
example, while a synthetic or natural gene coding for human insulin would be exogenous genetic material to a yeast cell (since 
yeast cells do not naturally contain insulin genes), a human insulin gene inserted into a human skin fibroblast celt would he a wild 
type gene with respect to the fibroblast since human skin fibroblasts contain genetic material encoding human insulin, although 
human skin fibroblasts do not express human insulin in biologically significant levels. 

Therapeutic nucleic acids contemplated for use in the practice of the present invention include those which encode 
products which are toxic to the cells in which they are expressed; or encode products which impart a beneficial property to a 
host; or those which transcribe nucleic acids which modulate transcription and/or translation of endogenous genes. 

As employed herein, the phrase ''therapeutic nucleic acids" refers to nucleic acids that impart a beneficial function to 
the host in which such nucleic acids are transcribed. Therapeutic nucleic acids are those that are not naturally found in host cells. 
For example, synthetic or natural nucleic acids coding for wild type human insulin would be therapeutic when inserted into a skin 
fibroblast cell so as to be expressed in a human host, where the human host is not otherwise capable of expressing functionally 
active human insulin in biologically significant levels. Further examples of therapeutic nucleic acids include nucleic acids that 
transcribe antisense constructs used to suppress the expression of endogenous genes. Such antisense transcripts bind 
endogenous nucleic acid (mRNA or DNA) and effectively cancel out the expression of the gene. In accordance with the methods 
described herein, therapeutic nucleic acids are expressed at a level that provides a therapeutically effective amount of the 
corresponding therapeutic protein. 
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Exogenous nucleic acids useful in the practice of the present invention include genes that encode biologically active 
proteins of interest, such as, e.g., secretory proteins that can be released from said cell; enzymes that can metabolize a toxic 
substance to produce a non-toxic substance, or that nietaboiize an inactive substance to produce a useful substance; regulatory 
proteins; ceil surface receptors; and the like. Useful genes include genes that encode blood clotting factors, such as human 
5 factors VIII and IX; genes that encode hormones, such as insulin, parathyroid hormone, luteinizing hormone releasing factor 
(LtiRH), alpha and beta seminal inhibins, and human growth hormone; genes that encode proteins, such as enzymes, the absence 
of which leads to the occurrence of an abnormal state: genes encoding cytokines or lymphokines such as interferons, granulocytic 
macrophage colony stimulating factor (Gft/I-CSF), colony stimulating factor-1 (CSF1), tumor neaosis factor (TNF), and 
erythropoietin (EPO); genes encoding inhibitor substances such as alphat-antitrypsin; genes encoding substances that function as 
1 0 dmgs, e.g,. genes encoding the diphtheria and cholera toxins; and the tike. 

Additional nucleic acids contemplated for use in accordance with the present invention include genes that encode 
proteins present in dopaminergic neurons (useful, for example, for the treatment of Parkinson's disease), cholinergic neurons 
(useful, for example, for the treatment of Alzheimer's disease), hippocampai pyramidal neurons (also useful for the treatment of 
Alzheimer's disease), norepinephrine neurons {useful, for example, for the treatment of epilepsy), spinal neurons (useful, for 
1 5 example, for the treatment of spinal injury), glutamatergic neurons (useful, for example, for the treatnnent of schizophrenia), 
cortical neurons (useful, for example, for the treatment of stroke and brain injury), motor and sensory neurons (useful, for 
example, for the treatment of amyotrophic lateral sclerosis), and the like. 

Typically, nucleic acid sequence information for proteins encoded by exogenous nucleic acid(s) contemplated for use 
employed herein can be located in one of many public access databases, e.g., GENBANK, EMBL, Swiss-Prot, and PIR, or in related 

20 journal publications. Thus, those of skill In the art have access to sequence inf onnation for virtually all known genes. Those of 
skill in the art can obtain the corresponding nucleic acid molecule directly from a public depository or from the institution that 
published the sequence. Optionally, once the nucleic acid sequence encoding a desired protein has been ascertained, the skilled 
artisan can employ routine methods, eg., polymerase chain reaction (PCB) amplification, to isolate the desired nucleic acid 
molecule from the appropriate nucleic acid library. Thus, all known nucleic acids encoding proteins of interest are available for 

2 5 use in the methods and products described herein. 

Additional components that can optionally be incorporated into the invention constructs include selectable markers ar)d 
genes encoding proteins required for retroviral packaging, e.g., the gene, the gene, the gene, and the like. 

Selectable markers contemplated for use in the practice of the present invention include antibiotic resistance genes, 
genes that enable celts to process metabolic intermediaries, and the like. Exemplary antibiotic resistance genes include genes 
30 which impart tetracycline resistance, genes that impart ampicillin resistance, neomycin resistance, hygromycin resistance, 
puromycin resistance, and the tike. 
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Genes that enable cells to process metabolic intermediaries include genes which permit cells to incorporate L-histidinoL 
genes encoding thymidine kinase, genes encoding xanthine-guanine phosphoribosyl transferase (gpth genes encoding dihydrof olate 
reductase, genes encoding asparagine synthetase, and the like. 

As employed herein, the terms "subject organism" and "host" refer to the cell, tissue, organ or organism in need of 
transcriptional regulation of exogenous or endogenous nucleic acids. The subject organism can be mammalian or mammalian- 
derived cells or tissue. Exemplary mammals include: humans: domesticated animals. e.g.. rat. mouse, rabbit, canine, feline, and 
the like; farm animals. e.g.. chicken, bovine, ovine, porcine, and the like; animals of zoological interest. e.g.. monkey, baboon, and 
the like, or a cell thereof. Alternatively, a subject organism can be a non-mammalian, preferably non-insect, such as a plant, 
fungus or other non-mammalian species, or a cell of such a non-mammalian species. 

As employed herein, the term "ligand" (or ligand precursor) refers to a steroidal or non steroidal substance or compound 
which, in its native state (or after conversion to its "active" form), binds to at least one of the protein units, or to the invention 
chimeric protein, thereby creating a ligand/functional entity complex, which in turn can bind an appropriate response element and 
activate transcription therefrom. Ligands function to modulate transcription of nucleic acid(s) maintained under the control of a 
response element. Such ligands are well known in the art. 

In accordance with one aspect of the present invention, unless and until a suitable ligand is administered to the host, 
substantially no transcription of the desired exogenous nucleic acids occurs. Since ecdysterotds. for example, are not naturally 
present in mammalian, plant and fungal systems, and the like, if it is desired that transcription of a particular exogenous nucleic 
acid be under precise control of the practitioner, a chimeric protein containing an ecdysone receptor as one of the protein units 
and a suitable dimer partner therefore is used and the exogenous nucleic acid is put under the control of an ecdysone response 
element. i.e. a response element to which an activated ecdysone receptor binds in nature. 

The terms "ecdysone" and "ecdysteroid" as interchangeably used herein, are employed in the generic sense {in 
accordance with common usage in the art), referring to a family of ligands with the appropriate binding and transactivation 
activity (see. for example. Cherbas et al.. in Biosynthesis, metatolism and mode of action of invertetrate itormom (Ed. J. 
Hoffmann and M. Porchet). Springer-Veriag. Beriin. p 305-322. An ecdysone. therefore, is a compound which acts to modulate 
gene transcription for a gene maintained under the control of an ecdysone response element. 

20 Hydroxy-ecdysone (also known as P-ecdysone) is the major naturally occurring ecdysone. Unsubstituted ecdysone 
(also known as a-ecdysone) is converted in peripheral tissues to P-ecdysone. Analogs of the naturally occurring ecdysones are 
also contemplated within the scope of the present invention. Examples of such analogs, commonly referred to as ecdysteroids^ 
include ponasterone A. 26 iodoponasterone A. muristerone A, inokosterone. 26 mesyiinokosterone. and the like. Since it has been 
previously reported that the above-described ecdysones are neither toxic, teratogenic, nor known to affect mammalian 
physiology, they are ideal candidates for use as inducers in cultured cells and transgenic mammals according to the invention 
methods. 
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Other phytoecdysteroids are also contemplated for use in the practice of the invention as ligands of receptors which 
recognizes ecdysone response elements. Such phytoecdysteroids are known in the art U.H. Adier et al.. Lipids 30(3):257*62, 
1995). The biological effect of phytoecdysteroids in higher animals are also known (V.N. Syrov, Eksp. Klin, farmakol. 57(5):6 1 -6, 
1994). 

Non steroidal iigands are also contemplated for use in the practice of the present invention as ligands of ecdysone 
response elements. For example, when a ligand not nomially present in the cells of the host to be treated is desired (i.e., a iigand 
exogenous to the host), a hydrazine can be employed as the ligand, preferably a diacyl hydrazine. Such hydrazines include 
compounds that are readily available and/or are relatively inexpensive to manufacture. One such compound, tebufenozide, is a 
non-steroidal ecdysone agonist which is commercially available. This compound specifically targets lepidopteran species, 
including BombyxmorL Tebufenozide has undergone extensive testing in animal hosts and has proved to be of very low toxicity 
to mammals and other non-insect species. 

Additional exemplary hydrazines contemplated for use herein include 1,2*diacyl hydrazines (e.g., tebufenozide), 
N' substituted-N,N' di$ubstituted hydrazines, dibenzoylalkyi cyanohydrazines, N-substituted N-alkyl-N,N'diaroyl hydrazines, N- 
substituted-N-acyl-N-alkyls, carbonyl hydrazines, N-aroyl-N'-alkyl-N'-aroyI hydrazines, and the like. Since it has been previously 
reported that the above-described diacyl hydrazines are neither toxic, teratogenic, nor known to affect mammalian physiology, 
they are ideal candidates for use as exogenic ligands {e.g. as inducers) in cultured cells and transgenic mammals according to 
invention methods. 

Ligands, and formulations containing them, administered in a manner compatible with the route of administration, the 
dosage f onnulation, and in a therapeutically effective amount. The required dosage will vary with the particular treatment 
desired, the degree and duration of therapeutic effect desired, the judgment of the practitioner, as well as properties peculiar to 
each individual. Moreover, suitable dosage ranges for systemic application depend on the route of administration. It is 
anticipated that dosages between about 10 micrograms and about 1 milligram per kilogram of body weight per day will be used 
for therapeutic treatment. 

An effective amount of ligand contemplated for use in the practice of the present invention is the amount of ligand (e.g., 
diacyl hydrazine(sl) required to achieve the desired level of transcription andior translation of exogenous nucleic add. A 
therapeutically effective amount is typically an amount of ligand or ligand precursor that, when administered in a physiological! y 
acceptable composition, is sufficient to achieve a plasma concentration of the transcribed or expressed nucleic acid product from 
about 0.1 mg/ml to about 1 00 mg/ml, for example, from about 1 .0 mg/ml to about 50 mg/ml, and preferably at least about 
2 mg/ml and usually 5 to 1 0 mg/ml. 

Ligand can be administered in a variety of ways, as are well-known in the art, i.e., by any means that produces contact 
between ligand and receptor peptide. For example, such ligands can be administered topically, orally, intravenously, 
intraperitoneally, intravascularly, and the like. The administration can be by any conventional means available for use in 
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conjunction with pharmaceuticals, e.g., by intravenous injectioa either as individual therapeutically active ingredients or in a 
combination with other therapeutically active ingredients. Ugands contemplated for use in the practice of the present invention 
can be administered alone, but are generally administered with a pharmaceutical carrier selected on the basis of the chosen route 
of administration and standard pharmaceutical practice. 

5 In accordance with a particular embodiment of the present invention, pharmaceutically acceptable formulations, and 

kits thereof, comprising at least one ligand for an invention functional dimer, for example an ecdysteroid, and a pharmaceutically 
acceptable carrier are contemplated. In accordance with another aspect of the present invention, pharmaceutically acceptable 
fonnulations consisting essentially of at least one Itgand and a phamriaceutically acceptable carrier, are contemplated. 
Phamiaceutical fonnulations of the present invention can be used in the forni of a solid, a solution, an emulsion, a dispersion, a 
10 micelle, a liposome, and the like, wherein the resulting fonmilation contains one or more of the ligands of the present invention, as 
an active ingredient, in admixture with an organic or inorganic carrier or excipient suitable for enteral or parenteral applications. 

The tigand(s) may be compounded, for example, with the usual non toxic, pharmaceutically acceptable carriers suitable 
for administration by oral, topical, nasal, transdermal, intravenous, subcutaneous, intramuscular, intracutaneous, intraperitoneal, 
intravascular, and the like means. Administration in the fonn of creams, lotions, tablets, dispersible powders, granules, syrups, 

1 5 elixirs, sterile aqueous or non*aqueous solutions, suspensions or emulsions, and the like, is contemplated. Exemplary 
phannaceuticaliy acceptable carriers include carriers for tablets, pellets, capsules, suppositories, solutions, emulsions, 
suspensions, and any other fomi suitable for use. Such earners which can be used include glucose, lactose, gum acacia, gelatin, 
mannitol, starch paste, magnesium trisilicate, talc, com starch, keratin, colloidal silica, potato starch, urea, medium chain length 
triglycerides, dextrans, and other carriers suitable for use in manufacturing preparations, in solid, semisolid, or liquid f omn. In 

20 addition auxiliary, stabilizing, thickening and coloring agents and/or perfumes may be used. The active compound {e.g., 

ecdysteroid as described herein) is included in the pharmaceutically acceptable formulation in an amount sufficient to produce the 
desired effect upon the process or condition of diseases. 

Phannaceuticaliy acceptable formulations containing iigand(s) as active ingredient may be in a form suitable for oral 
use, for example, as aqueous or oily suspensions, syrups or elixirs, tablets, troches, lozenges, dispersible powders or granules, 
25 emulsions, or hard or soft capsules. For the preparation of oral liquids, suitable carriers include emulsions, solutions, suspensions, 
symps, and the like, optionally containing additives such as wetting agents, emulsifying and suspending agents, dispersing agents, 
sweetening, flavoring, coloring, preserving and perfuming agents, and the like. Fonnulations intended for oral use may be 
prepared according to any method known to the art for the manufacture of pharmaceutically acceptable formulations. 

Tablets containing ligand(s) as active ingredient in admixture with non-toxic phannaceuticaliy acceptable excipients may 
30 also be manufactured by known methods. The excipients used may be, for example, (1 ) inert diluents such as calcium carbonate, 
lactose, calcium phosphate or sodium phosphate; (2) granulating and disintegrating agents such as corn starch, potato starch or 
alginic acid; (3) binding agents such as gum tragacanth, corn starch, gelatin or acacia, and (4) lubricating agents such as 
magnesium stearate, stearic acid or talc. The tablets may be uncoated or they may be coated by known techniques to delay 
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disintegration and absorption in the gastrointestinai tract and thereby provide a sustained action over a longer period. For 
example, a time delay material such as glyceryl monostearate or glyceryl distearate may be employed. They may also be coated 
by the techniques described in the U.S. Pat. Nos. 4,256,108; 4J60,45Z* and 4,265,874, to form osmotic therapeutic tablets for 
controlled release. 

5 In some cases, formulations for oral use may be in the form of hard gelatin capsules wherein the ligand is mixed with an 

inert solid diluent, for example, calcium carbonate, calcium phosphate or kaolin. They may also be in the form of soft gelatin 
capsules wherein the figand is mixed with water or an oil medium, for example, peanut oil, liquid paraffin, or olive oil. 

The pharmaceutically acceptable formulations may be in the form of a sterile injectable suspension. Suitable carriers 
include non-toxic parenterally-acceptable sterile aqueous or non-aqueous solutions, suspensions, or emulsions. This suspension 

10 may be formulated according to known methods using suitable dispersing or wetting agents and suspending agents. They can 
also be manufactured in the form of sterile water, or some other sterile injectable medium immediately before use. Sterile, fixed 
oils are conventionally employed as a solvent or suspending medium. For this purpose any bland fixed oil may be employed 
including symhetic mono- or diglycerides, fatty acids (including oleic acid), naturally occurring vegetable oils tike sesame oil, 
coconut oil, peanut oil, cottonseed oil, etc., or synthetic fatty vehicles like ethyl oleate or the like. They may. be sterilized, for 

1 5 exaniple, by filtration through a bacteria-retaining filter, by incorporating sterilizing agents into the formulations, by irradiating the 
formulations, or by heating the formulations. Sterile injectable suspensions may also contain adjuvants such as presenring, 
wetting, emulsifying, and dispersing agents* Buffers, preservatives, antioxidants, and the like can be incorporated as required. 

Compounds contemplated for use in the practice of the present invention may also be administered in the form of 
suppositories for rectal administration of the dnjg. These formulations may be prepared by mixing the dmg with a suitable 
20 non irritating excipient, such as cocoa butter, synthetic glyceride esters of polyethylene glycols, which are solid at ordinary 
temperatures, but liquefy and/or dissolve in the rectal cavity to release the dmg. 

Pharmaceutically acceptable formulations containing suitable ligandls) are preferably administered intravenously, as by 
injection of a unit dose, for example. The term "unit dose," when used in reference to a pharmaceutically acceptable formulation 
of the present invention, refers to a quantity of the pharmaceutical formulation suitable as unitary dosage for the subject, each 
25 unit containing a predetemiined quantity of active material calculated to produce the desired therapeutic effect in association 
with the required diluent, i.e., carrier, or vehicle. It may be particulariy advantageous to administer such formulations in depot or 
long-lasting form as discussed hereinafter. 

Therapeutic compositions or pharmaceutically acceptable formulations containing suitable ligand are preferably 
administered intravenously, as by injection of a unit dose, for example. The term "unit dose," when used in reference to a 
30 therapeutic composition of the present invention, refers to a quantity of ligand suitable as unitary dosage for the subject, each 
unit containing a predetermined quantity of active material calculated to produce the desired therapeutic effect in association 
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with the required diluent, i.e.. carrier, or vehicle. It may be particularly advantageous to administer such compounds in depot or 
long-lasting fomt 

Suitable regimes for initial administration and booster shots are variable, but are typified by an initial administration 
followed by repeated doses at one or more intervals, by a subsequent injection, or other administration. Alternatively, continuous 
intravenous infusion sufficient to maintain concentrations in the blood in the ranges specified for in vivo therapies are 
contemplated. 

In accordance with another embodiment of the present invention, there are provided methods for producing transgenic 
animals capable of prolonged and regulated expression of exogenous nucleic acidls). said method comprising introducing into early- 
stage embryos or stem cells of the animal: 

(i) a nucleic acid construct comprising a promoter and said exogenous nucleic acid(s) under the control of a 
regulatory element; and 

(ii) nucleic acid encoding an invention chimeric protein wherein the chimeric protein activates the regulatory 
element in the presence of a ligand for the functional dimer or represses the regulatory element independently of the 
presence of said ligand. 

As used herein, the phrase "transgenic animar refers to an animal that contains one or more expression constructs 
containing one or more exogenous nucleic acidls) under the transcription control of an operator andfor hormone response element 
as described herein. 

Methods of making transgenic animals using a particular nucleic acid construct are well-known in the art. When 
preparing invention transgenic animals, it is presently preferred that two transgenic lines are generated. The first line will 
express, for example, a chimeric protein as described above le.g.. VBEcR). Tissue specificity is conferred by the selection of. a 
tissue-specific promoter (e.g.. T-cell specific) that will direct expression of the chimeric protein to appropriate tissue. A smnd 
line contains a nucleic acid construct comprising a promoter and exogenous nucleic acid under the control of a response element, 
for example, an endogenous response element. Cross breeding of these two lines will provide a transgenic animal that expresses 
an invention chimeric protein and the exogenous nucleic acid. 

In a presently preferred embodiment, an invention transgenic animal contains one or more expression constnicts 
containing nucleic acid encoding an invention chimeric protein and exogenous nucleic acid under the transcription control of a 
response element. Thus, with tissue specific expression of the chimeric protein as described above and timely ligand treatment, 
gene expression can be induced or repressed with spatial dosage, and/or temporal specificity. 

In accordance with yet another embodiment of the present invention, there are provided methods for modulating the 
transcription of an exogenous nucleic acid in a host containing: 
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(i) a nucleic acid construct comprising a promoter and said exogenous nucleic acid(s) under the control of a 
response element; and 

(ii) nucleic acid under the control of an inducible promoter, said nucleic acid encoding an invention chimeric 
protein wherein the functional entity fomied by the invention chimeric protein activates or represses the response 

5 element in the presence of a ligand for the entity; and 

said method comprising introducing a ligand not normally present in the cells of the host and subjecting the 
host to conditions suitable to induce or repress expression of the invention chimeric protein. 

In accordance with yet another embodiment of the present invention, there are provided methods for the expression of 
recombinant products detrimental to a subject organism, said method comprising: 
1 0 transforming suitable cells in the organism virith: 

(i) a nucleic acid construct comprising a promoter and exogenous nucleic acidls) which express the recombinant 
product under the control of a regulatory element that is not normally present in the cells of said organism, and 

(ii) nucleic acid encoding an invention chimeric protein 

wherein the functional entity formed by the invention chimeric protein activates the regulatory element in the 
I S presence of a ligand for the functional entity; 

growing said cells to the desired level in the substantial absence of the ligand; and 
inducing expression of said recombinant product by administering to the organism a ligand, which, in 
combination with said entity, binds to said regulatory element and activates transcription therefrom 

Recombinant products detrimental to a host organism comemptated for expression in accordance with the present 
20 invention include any gene product that functions to confer a toxic effect on the organism. For example, inducible expression of a 
toxin such as the diphtheria toxin would allow for specific ablation of tissue (Ross et at. Genes and DGvelopment 7:1318-1324 
(1993)), for example to create a new phenotype in the transgenic animaL Moreover, the numerous gene products that are known 
to induce apoptosis in cells expressing such products are contemplated for use herein (see, e.g, Apoptosis, The Molecular Ba^Js of 
Cell Death, Current Communications In Cell & Molecular Biology, Cold Spring Harbor Laboratory Press, 1991). 

2^ In accordance with still another embodiment of the present invention, there are provided methods for modulating the 

transcription of nucleic acidis) in an in vitro cellular system, wherein the method comprises administering to the cellular system an 
amount of ligand effective to modulate the transcription of the nucleic acid[s); wherein the ligand is not normally present in the 
cellular system and wherein the system comprises: 

(t) a nucleic acid constnict comprising a promoter and the nucleic acid(s) under the control of a response 
30 element; and 
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fii) nucleic acid encoding an invention chimeric protein, 

wherein the functional entity formed by the invention chinieric protein activates or represses the regulatory 
element in the presence of a ligand for the ligand binding domain. 

in accordance with yet another embodiment of the present invention, there are provided methods for the treatment of a 
host in need of gene therapy, said method comprising: 

introducing into cells of said host: 

(i) a nucleic acid constmct comprising a promoter and the exogenous nucleic 3cid(s) under the control of a 
response element 

(ii) nucleic acid under the control of an inducible promoter, said nucleic acid encoding an invention chimeric 
protein, 

wherein the functional dimer fonmed by the invention chtmeric protein activates or represses the regulatory 
element in the presence of a ligand for the functional dimer^ and 

administering, to said host, an effective amount of ligand for the invention functional dimer. 

Optionally, the cells can be obtained from the host, modified as above, and then reintroduced into the host organism. 
For example, the exogenous nucleic acid can be introduced directly into cells obtained from a donor (host or separate donor) and 
the modified ceils are then implanted within the host organism. In a presently preferred embodiment, the transplanted celts are 
autologous with respect to the host. "Autologous" means that the donor and recipient of the cells are one and the same. 

Cells can be modified by "in vivo delivery" of biological materials by such routes of administration as oral, intravenous, 
subcutaneous, intraperitoneal intrathecal intramuscular, intracranial inhalational topical, transdermal suppository (rectal), 
pessary (vaginal), and the like. The exogenous nucleic acid may be stably incorporated into cells or may be transiently expressed 
using methods known in the art. 

Modified cells are cultivated under growth conditions (as opposed to protein expression conditions) until a desired 
density is achieved. Stably transfected mammalian celts may be prepared by transfecting cells with an expression vector having a 
selectable marker gene (such as, for example, the gene for thynvdine kinase, dihydrofolate reductase, neomycin resistance, and 
the like), and growing the transfected cells under conditions selective for cells expressing the marker gene. To prepare transient 
transf octants, mammalian cells are transfected with a reporter gene (such as the £. r^// 6-galactosidase gene) to monitor 
transfection efficiency. Selectable marker genes are typically not included in the transient transf ecttons because the 
transfectants are typically not grown under selective conditions, and are usually analyzed within a few days after transfection. 

The concept of gene replacement therapy for humans involves the introduction of functionally active "wild type" or 

"therapeutic" nucleic acids into the somatic ceils of an affected host to correct a gene defect or deficiency. However, in order for 

gene replacement therapy to be effective, it must be possible to control the time and location at which gene expression occurs. 
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Genes that encode useful "gene therapy" proteins that are not normally transported outside the cell can be used in the 
invention if such genes are "functionally appended" to, or operatively associated with, a signal sequence that can "transport" the 
encoded product across the eel! membrane. A variety of such signal sequences are known and can be used by those skilled in the 
art without undue experimentation. 

5 Gene transfer vectors (also referred to as "expression vectors") contemplated for use herein are recombinant nucleic 

acid molecules that are used to transport nucleic acid into host cells for expression and/or replication thereof. Expression vectors 
may be either circular or linear, and are capable of incorporating a variety of nucleic acid constructs therein. Expression vectors 
typically come in the form of a plasmid that, upon introduction into an appropriate host cell results in expression of the inserted 
nucleic acid. 

10 Suitable expression vectors for use herein are well known to those of skill in the art and include recombinant ONA or 

RNA constructlsl. such as plasmids, phage, recombinant virus or other vectors that, upon introduction into an appropriate host 
celt, resultls) in expression of the inserted DMA. Appropriate expression vectors are welt known to those of skill in the art and 
include those that are replicable in eukaryotic cells and/or prokaryotic ceils and those that remain episomal or those which 
integrate into the host cell genome. Expression vectors typically further contain other functionally important nucleic acid 

1 5 sequences encoding antibiotic resistance proteins, and the like. 

The amount of exogenous nucleic acid introduced into a host organism, cell or cellular system can be varied by those of 
skill in the art. For example, when a vtrai vector is ^ployed to achieve gene transfer, the amount of nucleic acid introduced can 
be varied by varying the amount of plaque {orming units (PFlf) of the viral vector. 

As used herein, the phrase "transcription regulatory region" refers to that ponion of a nucleic acid or gene constmct 
20 that controls the initiation of mRNA transcription. Regulatory regions contentplated for use herein, in the absence of the non- 
mammalian transactivator, typically comprise at least a minimal promoter in combination with a regulatory element responsive to 
the ligand/receptor peptide complex. A minimal promoter, when combined with a regulatory element, functions to initiate mFt IMA 
transcription in response to a iigand/functional dimer complex. However, transcription will not occur unless the required inducer 
(ligand therefor) is present. However, as described herein certain of the invention chimeric protein heterodimers activate or 
2 5 repress mRNA transcription even in the absence of ligand for the DNA binding domaia 

As used herein, the phrase "operatively associated with" refers to the functional relationship of DNA with regulatory 
and effector sequences of nucleotides, such as promoters, enhancers, transcriptional and transtationai stop sites, and other signal 
sequences. For example, operative linkage of DNA to a promoter refers to the physical and functional relationship between the 
DNA and promoter such that the transcription of such DNA is initiated from the promoter by an RNA polymerase that specifically 
30 recognizes, binds to and transcribes the DNA. 
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Preferably, the transcription regulatory region further comprises a binding site for ubiquitous transcription factor(s). 
Such binding sites are preferably positioned between the promoter and the regulatory element. Suitable ubiquitous transcription 
factors for use herein are well-known in the art and include, for example. Spl. 

Exemplary eukaryotic expression vectors include eukaryotic constructs, such as the pSV-2 gpt system (Mulligan et aL , 
5 (19791 Nature, 277:108-1 14); pBtueSkript (Stratagene, La Jolla, CAl the expression cloning vector described by Genetics 
Institute [Science, (1 985) 228:B10-815h and the like. Each of these plasmid vectors is capable of promoting expression of the 
chimeric protein of interest. 

Suitable means for introducing (transducing) expression vectors containing invention nucleic acid constnjcts into host 
cells to produce transduced recombinant celts (i.e., ceils containing recombinant heterologous nucleic acid) are well-known in the 

10 art (see, for review, Friedmann, Science, 244:1275*1281, 1989; Mulligan, Science, 260:926 932. 1993, each of which are 
incorporated herein by reference in their entirety). Exemplary methods of transduction include, e.g., infection employing viral 
vectors (see, e.g., U.S. Patent 4,405,712 and 4,650,764), calcium phosphate transfection (U.S. Patents 4,399,216 and 
4,634,665), dextran sulfate transfection, electroporation, lipofectlon (see, e.g., U.S. Patents 4,394,448 and 4,619,794), 
cytofection, particle bead bombardmeni and the tike. The transduced nucleic acid can optionally include sequences which allow 

1 5 for its extrachromosomal (i.e., episomal) maintenance, or the transduced nucleic add can be donor nucleic acid that integrates into 
the genome of the host 

In a specific embodiment, a gene transfer vector contemplated for use herein is a viral vector, such as Adenovirus, 
adeno associated virus, a herpes-simplex vims based vector, a synthetic vector for gene therapy, and the like (see, e.g., Suhr et 
aL Arch, ofNemoL 50:1252-1268, 1993), Preferably, a gene transfer vector employed herein is a retroviral vector. Retroviral 
20 vectors contemplated for use herein are gene transfer plasmids that have an expression construct contairung an exogenous 

nucleic acid residing between two retroviral LTRs. Retroviral vectors typically contain appropriate packaging signals that enable 
the retroviral vector, or RNA transcribed using the retroviral vector as a template, to be packaged into a viral virion in an 
appropriate packaging cell line (see. e.g., U.S. Patent 4,650,764). 

Suitable retroviral vectors for use herein are described, for example, in U.S. Patents 5,399,346 and 5,252,479; and in 
25 WlPO publications WO 92/07573, WO 90/06997. WO 89/05345, WO 92/05266 and WO 92/14829, each of which is hereby 
incorporated herein by reference, in its entirety. These documents provide a description of methods for efficiently introducing 
nucleic acids into human cells using such retroviral vectors. Other retroviral vectors include, for example, mouse mammary tumor 
virus vectors (e.g., Shackleford etal, (1988) PNAS, USA, 85:9655-9659), human innmunodeficiency virus (e.g., Naldini etaf, 
(1 996) Science 212: 1 65-3201, and the like. 

30 Various procedures are also well-known in the art for providing helper cells which produce retroviral vector particles 

that are essentially free of replicating virus. See, for example, U.S. Patent 4,650,764; Miller. Human Gene Therapy, 1:5-14, 
1990; Markowitz, et al.. Journal of Virology, £li4]:l 120-1 124, 1988: Watanabe. et aL Molecular and Cellular Biology, 
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3112):2241-2249, 1983; Danes, etaL, PNAS, 55:6460-6464, 1988; and Bosseimaa etaL Molecular and Cellular Biology, 
2(5): 1797- 1806. 1987, which disclose procedures for producing viral vectors and helper ceils that minirme the chances for 
producing a viral vector that includes a replicating virus. 

Recombinant retroviruses suitable for carrying out the invention methods are produced employing well-known methods 
5 for producing retroviral virions. See, for example, U.S. Patent 4,850,764; Miller, supra 1 990; Marko wit2, et aL supra 1 988; 
Watanabe, etaL supra 1983; Danes, etaL PNAS, 85:6460-6464, 1988; and Bosseimaa et aL Molecular and Cellular Biology, 
2151:1797.1806, 1987. 

For example, in one embodiment, a modular assembly retroviral vector (MARV) can be utilized to express the invention 
chimeric protein and an antibiotic resistance gene. A "covector** (referred to herein as MARSHA) can be utilized to provide a 
1 0 nucleic acid construct comprising the promoter, the regulatory element and exogenous nucleic acid, and a second antibiotic 
resistance gene. The MARSHA vector carrying exogenous nucleic acid also has LTRs modified to promote high-level expression 
only in the presence of the invention chimeric protein encoded by MARV and exogenous Itgand therefor. Co-infected prinrary 
mammalian cells can then be selected using both antibiotics, resulting in a cell population that is dependent on ligand for high level 
expression of the exogenous nucleic add. 

1 5 By introducing all of the necessary regulatory machinery, plus exogenous nucleic acid, selectable markers, and nucleic 

acid encoding invention chimeric protein, e.g., into a MARV retrovirus, highly efficient insertion of exogenous nucleic acids into 
targeted cells can be achieved. 

Thus, the above-described viral constructs address several important problems confronted in the use of retroviruses in 
application of therapeutic gene transfer strategies to a variety of human diseases. For example, the retroviral vectors of the 
20 invention are capable of prolonged gene expression under conditions where conventionally integrated retroviruses are no longer 
transcriptionally active. 

To illustrate the invention chimeric protein FOs. EcR was used as the steroid/thyroid hormone nuclear receptor and 
multiple examples using either RXR or Usp as the dimer partner were constructed, with either the EcR or the dimer partner 
positioned at the amino terminus. The ON A binding, transactivation, and dimerization properties of these several FD variants 
25 were compared with the properties of native receptor complexes. The size of the ENU and ENR FOs prepared was in the 
range from about 135 to about 145 kO; whereas E alone had a size of 94 kO, as shown by Western blot analysis. 

EcR Usp and EcR RXR FOs efficiently bind EcREs 

The FOs were first examined for their ability to bind to target EcREs in response to ligand. FD proteins were 
extracted from transiently transfected human 293 cells that were either untreated or treated with 1 ^M murA for 40 hours. 
30 To eliminate the possibility that certain of the FD constructs were translated with greater efficiency than others, which 
would lead to a false appearance of higher functional binding in comparative luciferase expression tests, p-galactosidase 
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expression of internal control plasmids cotransfected with FD constructs was performed. These tests indicated no 
significant differences in the transfection efficiency of individual FD constructs, indicating that the ONA binding differences 
observed for different FDs reflect imermolecular properties of the FDs themselves, not expression level. Accordingly, all 
reactions were normaliied to an internal |3-galactosidase control and total protein using the anti-EcR monoclonal amibodv 
5 DDA2.7 (Koelle Bt aL supra, 1991). 

FD constructs with either 0 or 5 linker segments were assayed for their ability to bind labeled EcREs. as shown in 
Figure 2B. A prominent band co-migrating with band shifts for the separate dimer complexes (E li and E-^R) was observed 
in many lanes and indicated that some of the FOs formed functional ONA-binding internal dimers. referred to herein as 
"endodimers." For example, ROE, which is analogous to (JOE, except for the substitution of U for ft as the dimer partner, 
10 formed a clear endodimer band that was increased 4-fotd by ligand, and demonstrated unequivocally a response to hormone. 
R5E, with a longer linker, had slightly decreased basal and slightly increased ligand-stimutated. EcRE binding (5-fotdl 
compared to that of ROE. 

Constructs in which E is positioned at the N termtnus form endodimers and bind the EcRE probe an average of 1 0 
times better than UNE constructs (Figure 2A). In addition, ENU constmcts bind probe 80*150% more readily than even 
15 those other FDs witfi high-level EcRE binding, such as E5R, but display nearly complete insensitivity to ligand for formation 
of DNA-binding complexes. This effect was found to be substantially independent of tinker length. 

ENR FOs also demonstrate a greater affinity for the EcRE probe than do RNE constructs. However, unlike EM U 
constructs, ENR FDs have a high degree of dependence upon the presence of ligand for formation of endodimers. 

The observation that ENU and ENR FDs bound probe better than either the RNE construct or the UNE construct 
20 prompted examination of whether the large 220 amino acid F domain of E (not found on R or U) accounted for this effect . 
Further EcRE probe binding studies performed utilizing several FDs having in frame incremental deletions of EOR to Nhel, 
Pvull, Narl, and Bglll sites within the ecdysone receptor F domain (Figure 2B) showed that the deletions had a minimal effect 
on either response to ligand, or on binding to the EcRE probe. Only EOR-ABglli, in which the extreme C-terminal end of the 
homfione binding domain is removed, displayed significant loss of the shifted band. These results suggest that flexibility 
25 within the long EcR F domain is not the primary determinant of improved ONA binding by FDs containing a EcR at the N 

terminal. This was presumed to be the result of a perturbed ligand binding pocket as opposed to decreased flexibility of the 
dimer partners joined by a linker. 

EcR RXR and EcR-Usp FD Transacttvation in Response to Ligand 

The results shown in Figure 2 clearly indicate that the EcR RXR and EcR Usp FD chimeric proteins could both 
30 respond to honnone and interact with the response element for EcR; however, these tests provided no indication regarding 
the function of these proteins to transactivate responsive promoters and induce gene expression. To detennine the ability of 
the FOs to transactivate responsive promoters and induce gene expression, the FDs and an EcRE-luciferase reporter piasmid 

32 



wo 01/36447 



FClVUSUU/41224 



were co transfected into 293 cells. The results of these studies revealed that the activity of the FDs was variably 
diminished compared to that of monomerrc receptors. 

It has also been discovered that the invention EMU FOs are constitutive repressors of transcription of a gene under 
the control of the corresponding steroid/thyroid hormone nuclear receptor's response element. For example. UNE constructs 
5 did not efficiently bind target EcREs (Figure 2)* and predictably did not have a dramatic influence on expression of the E4 (uc 
reporter plasmid. ENUs. on the other hand, although appearing to readily bind EcREs, were also unable to stimulate 
luciferase expression. To further confirm whether END proteins could bind target response elements, but had lost the 
capacity to transactivate, the ability of EOU to competitively block iigand-stimulated luciferase expression by monomeric 
receptors was tested. EOU elicited a dose-dependent inhibition of VE with endogenous dimer partner, whereas UOE bad 

10 virtually no inhibitory influence (Figure 5A). At the lowest ratio tested, a 1:20 ratio of EOU to VE decreased stimulated 
expression of VE by 20%. white equtmolar EOU blocked 80% of the response to ligand. At any concentration tested. UOE 
exhibited no suppressive effect on VE-mediated activation, and in fact, appeared to increase the stimulated level of 
expression by about 5% to about 15%. EOU had a similar Influence on E without VP1 6 fusion (Figure SB), and a lesser but 
measurable inhibitory effect on VE combined with separate exogenous Usp (Figure 5C|. The 50% inhibition of the VE^ U 

1 5 basal transactivation level may suggest that EOU binds the target EcRE about as well as complexes of the separate 
receptors. The plateau of the EOU suppressive effect at a 1:1 ratio with both E and E^U and the slightly increased 
stimulation at a 5:1 ratio of EOU to separate receptors indicate that, at higher concentrations, EOU can weakly transact ivate 
in response to added ligand. 

In the experiments described herein utilizing FDs containing various combinations of DEcR and BEcR with either 
20 RXR or Usp, the length of the linker was not observed to have a significant effect on any of the functions of the FDs. 

Surprisingly, even shortening of the large F domain of Drosophilia melanogaster EcR had little impact on ligand responsi ve 
DNA binding of EcR FDs. Without restriction of the scope of the invention by any theoretical speculations, two possible 
explanations of this observation are offered. One possibility is that there is enough flexibility within the structure of the 
individual receptors that any deformation necessary to allow appropriate dimerization can be tolerated while preserving, tn 
25 some cases, nearly complete activity. The second possibility is that the C-terminus of the 5' receptor naturally ties close in 
spatial proximity to the N terminus of the 3' receptor such that the distance is easily spanned. In the absence of detailed 
structural data of intact nuclear hormone receptor dimers, neither explanation can be ruled out. 

In the presence of ligand (i.e.. MurA). RXR -containing FOs functioned similarly to monomeric ECR^RXR, but at 
approximately one^half of the maximum level of transactivation (for E5R). One likely explanation for this is that the FD 
30 constructs are expressed or translated less efficiently than the monomeric receptors as suggested by comparing the 

intensity of FO lanes to the E lane in a Western blot analysis. Although the level of absolute transactivation was halved^ the 
relative induction of transcription by the ENR constructs, in particular, exceeded the relative induction of monomeric EcR and 
RXR. 
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To further address the partial or complete loss of transcriptional activation for FDs relative to the separate 
proteins, the powerful VP16 transactivating domain was coupled to the amino-termini of FDs or separate £cR to determine if 
this addition could restore lost transactivational capacity. While a linker containing 5 linker segments (65 amino acids) was 
sufficient to allow good DNA binding, it may not have allowed sufficient freedom of movement for other domains, including 
5 those responsible for transactivation, to fold or orient as they do in the native proteins. Length of the linkers in VE5R and 
VE5U was therefore increased in increments of 5 linker segments each to 10 and uhimately 20 linker segments (240 5 
amino acids). 

Addition of the VP16 activation domain to the amino terminus of the ENR FDs restored the full transactivation 
potential of the FDs relative to separate VP16*fused EcR. suggesting that addition of a strong transactivating domain 

1 0 overrides transiational deficits caused by incorporating the receptor/dimer partner into a chimeric protein. However, V P 1 6- 
EMU (VEND) chimeric proteins never exceeded 30% of the absolute level of transactivation of separate VE-f U. One possible 
explanation is that some conformation constraint in the Usp half of the VENU constructs prevented the interaction of VENU 
with endogenous cofactors necessary for full high level expression. A second possibility is that increased spontaneous 
heterodimerization within EMU FDs results in a conformational alteration that decreases or blocks ligand binding, assuming 

1 5 that tigand plays a direct role in transactivation and not just dimerization. 

High level VENR transactivation suggests that the very low level expression observed from untiganded ENfi 
constructs in Figure 3 was due to transcriptional repression by bound ENR proteins. The addition of VP16 increased the 
basal VENR FD expression over S fold in comparison to separate VECR^RXR. This presumably reflected an increased level 
of spontaneous dimer formation of ENR FDs resulting from the forced proximity of the separate components by the linker. 
20 This phenomenon may not have been readily evident in the gel shift experiments as the result of the short transient period of 
interaction of the proteins with the DNA probes ( < 30 minutes) compared to the duration with the responsive promoters in 
the transient transfection experiments (> 30 hours). 

Heterodtmer formation and DNA binding of FDs 

Five different classes of interaction of protein units in the invention chimeric protein(s) (Figure 7) were predicted: 
25 1 ) "disorganized", indicative of non-interaction by the individual receptor components; 2) "endodimer", indicating formation 
of functional dimers approximating the native heterodimer complex; 3) "Xmet", indicating interaction with a separate 
monomeric partner, 4) ''tetramer^ indicative of two mutually cross-interacting chimeric proteins ; or 5) "otigomers", 
representing chimeric proteins chain-interacting with each other. The data presented herein provides clear evidence that 
disorganized, endodimer, and trimer species were formed when the invention chimeric proteins contained two protein units 
30 (FDs), with endodimer formation predominating. Only the UNE constructs appeared to be largely disorganized, as evidenced 
by their apparent inability even to bind DNA with high affinity. All remaining FD classes showed abundant evidence of 
endodimer fomiation, although RNE constructs were noticeably weaker than constructs with EcR in the amino terminal 
position. The transiem transfection competition experiment (Figure 4) indirectly indicates that a high affinity monomeric 
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dimer partner, such as VUsp, can displace a weaker intramolecular RXR dimer partner to form a partially functional trimer 
species; however, such a monomeric, high affinity dimer partner was much less capable of displacing an intfamolecular Usp 
under the same circumstances. The lower affinity VRXR monomeric dimer partner was unable to displace to any significant 
degree either Usp or RXR as an intramolecular dimer partner in an invention FO fusion protein. 

5 Evidence of formation of higher order constructs, such as tetramers or oligomers, in gel shift assays is scar^t. but 

may be suggested in lanes of the gel shift assay with UNE and RNE constructs. Although band shifts were weak, UNE (and 
unliganded RNEI constructs had slightly intensified bands of higher molecular weight than the corresponding size of the 
endodimer bands. The faint but detectable high molecular weight shift bands observed for FOs in some lanes of the assays 
suggest tetramer and oligomer formation, presumably through cross interaction of the ecdysone receptor component of one 
1 0 FO with the dimer partner unit of another. These results, coupled with the results of competition experiments using 
superphysiological levels of competing dimer partner, suggest, in any event, that multimerization is relatively rare and is 
likely to occur, at even low levels, only with those FDs that have decreased capacity for endodimer formation. 

These data further support the supposition that proximity to dimer partner (i.e., as in invention chimeric protein(s)) 
not only limits dimer partner preference, but also increases the ease of dimer formation and ON A binding for some of the 
1 5 fusion constructs relative to monomeric receptors. For example, ENU constructs displayed high-level complex formation with 
the EcRE probe (a 1.1* and Q.9 fold increase for EU and E5U, respectively) even in the absence of iigand, while separate EcR 
and Usp required ligand for maximal complex fomtation. ENR constructs, on the other hand, still retained much of their 
original ligand dependence, indicating that dimer partner proximity is not the sole, or perhaps even most important, 
determinant of dimer formation. 

The degree to which FOs interact with external receptors to form a trimer complex was indirectly examined in the 
studies showing their interaction with high levels of competing VP16-fusion dimer partners and the resulting effect on 
transactivation of the E4 luc reporter. In these siu6tes, monomeric VRXR was unable to enter the FO complex of any 
construct, suggesting that the EcR component of FDs much prefers a linked dimer partner of either high or low affinity to a 
separate low affinity dimer partner. VUsp, on the other hand, had a comparatively smaller effect on transactivation induced 
by the E5U construct, than on E5R. indicating that the EcR protein in E5U preferentially dimerizes with the linked Usp, while 
the RXR dimer partner of E5R may enter a complex with monomeric VUsp, 

In summary, the results of the studies described herein indicate that selected chimeric proteins of steroidfthyroid 
hormone nuclear receptors with appropriate dimer partners can retain most of the primary characteristics of the native 
complex: binding of ligand, recognition and binding of cognate response elements, and, in some cases, ligand stimulated 
30 transactivation of responsive promoters. Subsets of the constmcts prepared displayed varying degrees of these 

characteristics. The R/UI\IE proteins characteristically exhibited low DMA binding and transactivation capacity while the 
ENR/U proteins uniformly demonstrated wild-type or superior EcRE binding and variable capacity to transactivate, resulting 
in properties ranging from constitutive repression to essentially wildtype, ligand responsive transactivation. 
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The invention herein provides the advantage that, for many studies in cultured cells or transgenic animals, 
invention FDs wilt allow the examination of specific heterodimer pairs with much decreased potential for contamination with 
exterior dimer partners, such as those endogenously produced in the test cell or animal. This may be of particular assistance 
in examining the function of specific RXR subtype combinations, or even for further studying the potential for Usp-Usp 
interactions. 

The invention herein provides the further advantage that specific heterodimer pairs can be examined by their 
introduction into the system as a single chimeric protein, e.g., as a fusion protein, rather than by separate introduction of 
two constructs. 

In addition, studies described in the Examples herein indicate that many of the combinations may have unique 
properties of ligand independence or repression that may have significance to their application for therapeutic purposes. For 
example, certain of the invention chimeric proteins that transactivate gene expression may be useful as a "gene switch, " for 
modulating expression of an exogenous gene in a mammalian system or in plants, fungi and other non mammaiian species. 
When the FDs transactivate the response element-containing promoter (e.g.. in the presence of ligand), the exogenous gene 
is switched on; when the FDs repress the promoter, the exogenous gene is switched off. 

In nature, dimers of nuclear hormone receptors are unstable and. hence, are not useful in x ray crystallography 
studies to determine structure. Because of the demonstrated stability of the invention FD heterodimers. they may be 
advantageously used in the preparation of crystals for x-ray diffraction studies for use in rational design of Itgands to 
develop new steroids, insecticides, steroid antagonists, and the tike, as described hereinbelow. Crystal structure may also 
permit deduction of the structure of ligands for orphan receptors. It is also contemplated that such crystals of the invention 
FDs can be used for preparation of antibodies that react with the heterodimers using methods known in the art. 

In accordance with another embodiment of the present invention, there are provided isolated protein crystals 
suitable for x ray diffraction analysis of a purified invention chimeric protein. In alternative embodiments, the crystal may be 
obtained of a ligand bound to a purified chimeric protein so as to form a chimeric protein-iigand complex, or a crystal may be 
obtained of a putative response element bound to purified fusion protein or fusion protein ligand complex as described herein. 
The invention additionally contemplates a set of x-ray diffraction crystal coordinates obtained by x ray diffraction of any 
such invention isolated protein crystals. 

A variety of methods are known in the art for purifying proteins and obtaining crystals of the purified proteins, for 
example growing crystals in microgravity andlor by vapor diffusion (O.R. Davies and D. M. Segal. Meth. Emymol, 22:266 , 
1971). Crystals of purified proteins can also be obtained commercially. To aid in the purification of the invention fusion 
proteins, it is recommended to add a His tag to the amino terminus of the fusion protein, as is known in the art and described 
in Example 6 herein. Addition of such a His tag does not interfere with dimerization of the invention fusion proteins. 
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in accordance with still another embodiment of the present invention, there are provided methods for identifying 
potential iigand(s| for member(s) of the steroid/thyroid hormone nuclear receptor superfamily utilizing a set of atomic 
coordinates obtained by x-ray diffraction analysis of an invention purified protein crystal. The invention assay method 
comprises creating a ttiree^dimensionaf structure of a chimeric protein formed into a functional entity (i.e., by dimerization of 
dimerization domains contained therein) as defined by the atomic coordinates obtained by x-ray diffraction studies, 
employing the three-dimensional stnicture to design or select the potential ligand; synthesizing the potential ligand; and then 
contacting the potential ligand with an invention functional entity in the presence of a response element operativety linked to 
a marker protein under conditions suitable for causing expression of the marker protein to determine the ability of the 
potential ligand to transactivate expression of the marker protein. The potential ligand can be designed de novo or designed 
from a ligand. 

Methods for obtaining a set of atomic coordinates of a protein crystal using x-ray diffraction and for creating a 
three-dimensional model of a protein from such a set of atomic coordinates are known in the art. Such procedures are 
disclosed, for example, in U. S. Patent No. 5,856,1 16, which is incorporated herein by reference in its entirety. For example, 
x-ray data sets can be collected on a R-axts ItC image plate system and/or on a 2.2A Synchrotron data set for refinement of 
1 5 the three-dimensional structure (i.e., the model). Then, the data can be collected at Cornell High Energy Synchrotron Source 
("CHESS") on a charge-coupie device and reduced to stnicture factor amplitudes using the Denzo Software Package (Oenzo • 
An Oscillation Data Processing Program For Macro Molecular Crystallography, ^^1993, Daniel Gewirth, Yale University). 
Oscillation photographs can be integrated and reduced to structure factor amplitudes using software supplied by the 
manufacturer (Molecular Stnictures Corp., Dallas, TexJ. 

Refined heavy atom parameters can be used to compute multiple isomorphous replacement phases. Solvent 
flattening and phase extension (CCP4-Cotl3borative Computing Project No. 4. A Suite of Programs for Protein 
Crystallography; Daresbury Laboratory, Warrington, WA4 4 AO, U.K. (1979)) can be used to improve the map and allow 
identification of some of the residues in the protein core. Cycles of model building (Quanta, version 4.0b, Molecular 
Simulations Inc., Burlington Mass.). positional refinement, (Brunger, A. T., 1 Acta Cryst., M&48-57, 1990); Brunger, A. T. 
et a)., J, Acta Cryst., MS:58S-93, 1990) and phase combination (CCP4-Collaborattve Computing Project, st/pra) can be 
carried out until the switch to phases calculated from the model can be made. Refinement against -16 ''C, 2.2.A data can 
be continued to allow the more difficult loop regions of the protein to be constructed. 

The invention will now be described in greater detail by reference to the following non-limiting examples. 

EXAMPLE 1 

30 Design of ecdysone receptor-UsplRXR functional dimers 

Two classes of chimeric proteins were constructed as fusion proteins to study the activity of EcR UspfRXR 
functional dimers. in one class, EcR is at the N-terminus of a fusion protein, and in the other class the biniling partner (either 
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lisp or RXR) is at the N-ierminus (Figure 1). To facilitate formation of the functional dimers and allow for insertion of 
polypeptide linkers between the receptor and its binding partner in the fusion protein, a 5 amino acid bridge that also 
encodes the restriction endonuclease site for Sfil was inserted between the two open reading frames (ORFsl. 

A double stranded Sfil compatible oligonucleotide encoding the amino acid sequence GPGGGSGGGSGT (SEQ ID 
5 N0:1 7) was designed to provide a high degree of predicted flexibility while attempting to minimize repetitive sequence within 
the oligonucleotide. This nucleotide sequence incorporated the Sfil site at the 5' end of the insert to allow for ease in 
increasing the number of linker segments within a previously existing construct. By phosphorytating the SG base^pair double 
stranded oligonucleotides and ligating them into Sfil digested FD plasmid templates, FDs were produced with peptide linkers 
of variable length that increased by 12 amino acid increments. 

1 0 Construction of fusion proteins containing the ecdysone receptor. 

Figure 1 shows the schematically fusion protein functional dimer constructs R/U(N)E and E(N)R1U. Construction of 
the invention fusion proteins began with modification of the N and C termini of human RXRa, dmlisp and dmEcR ORFs 
subcloned into the cloning vector SK NBN (pBSK with a modified polylinker). An Sfil site was inserted at either end of each 
receptor, in frame, by PGR mutagenesis. 

1 5 For the hRXR N terminal Sfil site, the primer in the 5' direction was 

GTAGAAnCGGCCAACAGGGCCCATGGACACCAAACATHC (SEQ 10 N0:18); and the primer in the 3' direction was 
GATGGGGGAGCTCA666TGC (SEQ 10 N0:19). 

For the C-terminal Sfil site, the primer in the 5' direction was GGAGAGCTCGAGGCCTACTGCA (SEQ ID N0:2O ); 
and the primer in the 3' direction was ACCATCGAnCAGGGCCCTGnGGCCCGTGCGGCGCCTC (SEQ ID N0:21). 

20 For the dmusp N terminal Sfil site, the primer in the 5' direction was 

GTAGAATTCGGCCAACAGGGCCCATGGACAACTGCGACCAG (SEO ID N0:22); and the primer in the 3' direction was 
CAGCACGTOGACCAHGACA (SEQ ID N0:23). 

For the C-terminal Sfil site, the primer in the S' direction was G6AGA6CTCTTTCTCGAGCAGCTG (SEQ ID NQ:24|; 
and the primer in the 3' direction was ACCATCGAnCAGGGCCCTGnGGCCCCTCCAGTTTCATCGCCA GGCCG (SEQ ID 
25 N0:25). 

For the ecdysone receptor N-terminal Sfil site, VP 16 sequences were fused in frame to the Ncol site approximately 
200 base pairs into the ecdysone receptor ORF. creating an Sfil site at the VPIB-ecdysone receptor boundary. 

For the VP 16 insertion site, the primer in the 5' direction was 
CATAAGCnATGGGACAGACACTGATGGGACGGCCC (SEQ ID N0:26) and the primer in the 3' direction was 
30 CAGAGACCATGGGCCCTGnGGCCCCCCACC (SEQ ID N0:27). 
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For the ecdysone receptor C-terminus insertion site, the primer in the 5' direction was TTACCGCTAGCTCCACCA 
(SEQ 10 N0:28); and the primer in the 3' direction was GTAGATATCAGGGCCCTGnGGCCCAGTCGTCGAGT (SEQ ID 
N0:29). All primer sequences are written 5' to 3'. 

For VP16 (S.J. Triezenberg etaL GenBS Dev., 2:718-729, 1988) fusion to RXR and Usp, the VP16 sequence region 
5 was removed from VE using the Sfil site at the 3' boundary of VP 1 6 sequence for fusion of the 260 base pair VP1 6 

fragment into the N-terminai compatible Sfil site of previously modified RXR and Usp ORFs. All fusion receptor variants 
were originally produced by insertion of both ORFs at the central Sfil site. Linker segments with Sfil compatible overhangs 
were produced by annealing two linker-encoding oligonucleotides having the sequence 
GGGCCAGGAGGTGGCTCCG66GGAGGTTCAGGCACA (SEQ ID IUQ:30) in the 5' direction, and the sequence 
10 GCCTGAACCTCCCCCGGAGCCACCTCCTGGCCCTGT (SEQ ID N0:31> in the 3' direction. 

EcR F-domain deletion constructs were produced by inserting an in*frame polylinker upstream of the Sfil N-terminal 
modified RXR for reception of compatible F-domain deleted ecdysone receptor fragments. The polylinker in the 5' direction. 
AAGCnGAGAGATCTGGGACGGCGCCCCCGGGGCTAGCGGGCCAACA (SEQ ID N0:32) encoded (from Bgl II) the peptide 
sequence IWDGAPGAS (SEQ ID N0:33) and restriction sites Hind III Bgl II Nar l-Sma I Nhe I with an Sfil compatible 3' end. 
1 5 Hind III Bgl 11, Hind lll-Nar I, etc. fragments of the ecdysone receptor were inserted into this polylinker for fusion of F domain 
deletions to RXR. Figure 2B shows schematically the F-domain deletion constructs of EOR made by this procedure. 

PCR reactions for production of receptor mutants were performed using 100 ng plasmid template, 500 ng of each 
primer, and reaction conditions outlined by the manufacturer for Pwo (Boerhinger Mannheim) high-fidelity polymerase. A 
program of 1 min. 94**C/1 min. 45*C/1 min. 72**C/1 min. for 20 cycles was used for production of alt PCR products used. 

20 For constructs containing multiple repeat linker segments, fusion receptors were linearized by Sfil digest, and linker segment 
oligonucleotides, kinased to allow multiple tandem insertions, were ligated into the site by standard methods. Inserted linker 
segment repeats of between 0 and 5 linker segments were found by restriction endonuciease digest followed by sizing on 
3% agarose gels. For the studies reported here, the minimum linker length contained only the 5 amino acid fusion bridge 
(signified herein by linker segment designation "O") and the maximum was 245 amino acids (including a five amino acid 

25 fusion bridge) (signified herein by linker segment designation "20"). 

Plasmids from clones of interest were prepared on a large scale for use in transfection and other analysis including 
confirmatory sequencing of constructs. All receptors were subcloned into vector LNCX (A.D. Miller, GenBank Acc. No. 
M28247) (with an extended polylinker) for use in transfection. 

EXAMPLE 2 

30 Transfection of FD constructs. 

For quantitative transactivation analysis, transfections were performed in triplicate in 24 well plates by calcium- 
phosphate co-precipitation with 100 ng of an individual receptor, the reporter plasmid E4 luc, and pCH1 10 (SV40 (3' 
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gaiactosidase) as an internal control. Briefly, the reporter piasniid. E4-tuc, was constructed of 4>tandem EcREs inserted 
upstream of a tfiymidine kinase gene minimal promoter directing lucif erase expression. EcRE oligonucieotides were as 
described by Thomas et aL, supra (1993) with BamHl/Bglll compatible ends. 1 ligand was added at the time of 
transfection, and the cells were harvested for luciferase assay 40 hours later. Harvested cell extracts were split, and one 
5 part was analyzed in a luminometer for luciferase activity and the other part was analyzed for [5- gaiactosidase activity using 
an orthonitrophenyl gatactoside assay by standard methods. Luciferase levels were normalized to 3-gatactosidase values to 
correct for slight differences in transfection efficiency. 

Preparation of the invention fusion protein FDs for gel shift analysis by transient transfection into 293 cells was as 
follows: 300 ng of individual receptor plasmids and 100 ng pCH1 10 internal control plasmid were cotransfected into 293 

10 cells at 60% density in Costar 6-we(t plates. One well of each group was treated with 1 ^M murA as ligand at the time of 
transfection. 40 hours later, extracts of transfected 293 cells were made by scraping the cells from a well into a low volume 
of phosphate buffered saline, pelleting the ceiis. resuspending them in 200 ^l SX gel shift buffer (Yao et aL supra 1992). 
and sonicating with a Kontes cell disrupter for three lO sec. bursts at output level 30. The extracts were then centrifuged 
to clear the lysate and the protein was quantified. Extract volumes were adjusted with buffer to a concentration of 1 mg/ml 

1 5 and frozen at -JO^'C until use. 0-gaiactosidase activity was assayed as above to determine relative transfection efficiency 
of each well. 

EXAIi/IPLE3 
Gel mobility shift analysts. 

Comparative gel mobility shift analyses of FDs with control in vitro translated receptor complexes were performed 
20 using double stranded EcRE probes and labeled by Klenow fill using ^^P-dCTP and cold dGAT by standard methods. //7 y/tro 
translated proteins used as controls were produced using the T3/T7 TNT IPromega) transcription/translation system 
following the manufacturer's protocol. //? v/tro translated proteins were qualitatively examined by 5% SDS PAGE using 
protocols as described (J. Sambrook et a/. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press. Cold Spring 
Harbor, New York, 1989) to ensure the presence of full-length products of the proper molecular weight. Reaction conditions 
25 for protein-probe interaction and gel electrophoresis were essentially identical to those disclosed in Yao etaL supra 1 992. 
However to improve comparison between samples, reaction mixtures (including dimer partners and probel were prepared as 
a cocktail and distributed equally to individual tubes with receptor proteins, ^ gaiactosidase assay indicated that all 
samples were essentially equivalent, so equal volumes (10 ^l) of extract were used in each reaction in a final reaction 
volume of 30 The reactions were allowed to proceed at 23"C for 5 minutes at which time ligand or vehicle was added 
30 and the reaction allowed to continue for 20 additional minutes. Band volumes were quantified using laser scanning 
densitometry. 
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Western blot analysis 

6-weii plates of 293 celts transfected with 2 of receptor construct and 100 ng of pCH1 10 were harvested in 
PBS and lysed by three rounds of freezing and tfiawing. P-galactosidase assay of lysates indicated equivalent transfection 
efficiency for all constructs so equivalent protein (7.5 ^o) was loaded and run on a 12*5% SOS^PAGE gei and transferred to 
5 nitrocellulose by standard methods ISamimk et aL supra 1989). The transferred fitter was incubated with the anti-EcR 
monoclonal antibody OOA2.7 {Koelle etaL supra 1991) at a 1/1000 dilution at 4°C for 48 hours. After washes, anti-mouse 
IgG (1/5000) was added for 1 hour, washed away* and the blot processed for chemituminescence and exposed to film. 

An autoradiogram of FDs and controls treated either with control vehicle or with 1 \M of murA as tigand was 
made with markers indicating endodimer FD or wild type receptor-binding complex band shifts. E -t-U and E+R were control 
1 0 lanes of in vitro translated proteins for sizing of endodimer band shifts. Figure 2 A is a graph that quantifies endodimer sized 
band volumes obtained from the autoradiogram. 

A prominent band co-migrating with band shifts for the separate dtmer complexes (E -t^ U and E -f- Bl was observed in 
many lanes and indicated that some of the FDs formed functional DNA-binding internal dinters that we designate 
"endodimers." UOE displayed a barely detectable endodimer band that was perceptibly increased (2-fold) in intensity by the 

1 5 presence of ligand (Figure 26). The addition of 5 linker segments in USE appeared to amplify the overall intensity of the 
shift, but still displayed minimal response to ligand. UNE constnicts displayed faint bands above the weak endodimer band- 
shift that were equally as intense. These bands were occasionally visible but were proportionally less intense than the 
endodimer band shift of other FDs described below. ROE, which is analogous to UOE but with substitution of U for R, 
formed a clear endodimer band that was increased 4-fold by ligand (Figure 2B)> and unequivocally demonstrated FD responds 

20 to hormone. R5E. with a longer linker, had slightly decreased basal, and slightly increased ligand-stimulated, EcRE binding 
15-fold) compared to ROE. The higher molecular weight shift bands observed in UNE lanes were not visible in ligand treated 
R5E lanes (Figures 2A). 

EMU constructs indicated that FDs in which E was positioned at the N*termtnus formed endodtmers and bound the 
EcRE probe an average of 10 times better than UNE constructs. ENU constructs bound probe 80- 1 50% more readily than 

25 even other FDs with high-level EcRE binding (i.e., ESR, discussed below), but displayed nearly complete insensitivity to ligand 
for formation of ONA-binding complexes. Like UNE FOs, the longer tinker length of E5U did not significantly increase the 
binding to EcRE or responsiveness to murA, relative to EOU (Figures 2A and 26). ENR constructs, like ENUs, also 
demonstrated a greater affinity for the probe than the reversed constructs (Figure 2A). Unlike ENU constructs. ENR FOs had 
a high degree of ligand dependence for endodimer formation. E5R displayed a slightly decreased shift from the rate of basal 

30 transcription, and a slightly elevated ligand-stimulated shift (1 I fold relative induction) in comparison to EOR |7 fold), much 
like ROE and R5E. Non-transfected cells, or cells transfected only with E, displayed no detectable shift even after prolonged 
exposure. At concentrations of protein higher than those used in these gel shift reactions, a shift of E in combination with 
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endogenous RXR was observed. Separate experiments also confirmed that FOs specifically bound EcREs and not unrelated 
control probes that included thyroid hormone response elements. 

Figure 3 shows relative luciferase expression of FD constructs with or without 1 ^M murA. The results of these 
studies are summarized in Table 1 below and show repression of monomeric receptors and monomeric dimer partners by EOU 
andUOE FDs. 

TABLE 1 





UOE 


ROE 


USE 


R5E 


EOU 


EOR 


ESU 


E5R 


E 


E+U 




0.75 


0.18 


0.52 


0.25 


0.17 


0.21 


0.41 


0.46 


1.08 


1.74 


♦Ug/E, 


0.55 


1.75 


0.69 


1.50 


0.40 


3.39 


0.62 


3.89 


7.72 


8.34 


Rel. Ind. 


0.7 


9.7 


2.7 


6.0 


2.4 


16.2 


1.6 


8.4 


7.1 


4.8 



•Ltg/E4 « luciferase activity in cells transiently co-transfected with FDs and monomeric receptors vs. reporter only 
without (igand 

^Ug/E4 ^luciferase activity in cells transiently co-transfected with FOs and monomeric receptors vs. reporter 
only with ligand 

Rel. Ind. - relative induction of individual receptor groups 

As shown in Table 1, UNE and ENU constructs either did not stimulate luciferase expression or appeared to 
actually function as repressors of basal transcription. EQli in the absence of ligand, for instance, reduced E4-luc 
transcription to only 1 7% of the basal expression level. RNE and ENR constructs, by comparison^ functioned much more 
like separate E+R, although both the basal and induced levels were proportionally decreased relative to the monomeric 
receptors. As might be predicted from the mobility shift experiments. E5R provided the closest profile to the monomeric 
{i.e., wild type) separate receptors, having approximately 50% of the E^R induced expression level. By contrast, both EOR 
and E5R, however, had greater relative inductions than separate E-t-R (16.2* and 8.4-fotd, respectively, versus 7.1-foid). 

EXAMPLE 4 

Addition of the potent VPt6 transacttvation (t) domain to the N-terminus of FD constructs was used to further 
examine transactivation by FDs. To test the possibility that distortion of the endodimer by a short linker {0 linker segments) 
contributed to inhibited transactivation, the number of linker segments in the linker between the units in the invention fusion 
proteins was expanded to a maximum of 20 linker segments for ENR and ENU variants. As shown in Figure 4A, addition of a 
VP16 1 domain to either the E monomer (VE) or to FDs IVENR or VENU) resulted in a 12 to 15'fold overall increase in 
luciferase expression. These VENR FDs containing linkers of variable length produce a stimulated level of transcription 
virtually identical to separate VE protein, suggesting that augmentation of ENR proteins with VP1 6 1 domain compensated 
for any loss in transactivation resulting from the fusion of the receptor and dimer partner into the invention fusion proteins. 
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Notably, however, the level of basal induced expression was increased 7 to S fold over monomeric receptors. This resulted 
in a dramatic decrease in the relative fold induction, from 27.8-fold for separate VE to 4.1 to 5.4'f old for FDs (Figure 4 A). In 
addition, the increase from a 5 amino acid linker to a 245 amino acid linker had little effect on either basal or activated 
VENR transacttvatton with the exception of a subtle decrease in both levels in the FDs having the longer 125 110 linker 
5 segments) and 245 amino acid (20 linker segment) linkers. As shown in Figure 4B, VEND constructs displayed neither the 
basal nor induced level of expression of separate VE-t-U, even though the overall level of transactivation was significantly 
increased relative to i^U complexes without a heterologous transactivating domain. Like the VENR proteins, the addition of 
increased linker segments to the VENU constructs had a minimal effect on basal or induced transcriptional activation. 

EXAMPLE S 

10 To elucidate the propensity of fusion protein partners to dimerize with each other over other monomeric suitable 

dimer partners, the influence of monomeric dimer partners on FD transactivation properties was assayed. In the transient 
transfection experiments shown in Figure 6. R and U proteins with N terminal VP16 fusions (VR and VU. respectively) were 
utilized to probe E5U and E5R promiscuity. E5R and E5U constructs were used because previous experiments suggested 
that they displayed properties that were the most similar to monomeric (i.e., wild type) receptors. When both E5R and ESU 

1 5 constructs were cotransf ected into 293 ceils in equimolar quantities along with the E4-luc reporter, VR cotransf ection was 
not observed to significantly influence either ESU or E5R function, even though VR was found to augment E mediated 
transactivation alone by > 5-fold either with or without ligand (Figure 6). VU, on the other hand* delectably interacted with 
E5R, and to a lesser extent with ESU. Ligand dependent transactivation was observed for ESU VUsp at approximately 1 0% 
of the ligand stimulated level of VUsp with E, whereas ESR^VUsp activated to nearly 50% of the VUsp * I level. VR and 

20 VU atone had no influence on E4-luc expression either with or without murA. 

EXAMPLES 

Using a modification of the method for constructing invention fusion proteins described above in Example ), fusion 
proteins were constructed having a Bombyx ecdysone receptor (BEcR) in the amino terminal half of the fusion protein, a 
linker bridge of 5 amino acids, and either RXR IBEOR) or Usp (BEOU) as the dimer partner placed at the C-terminal half of the 
25 fusion protein. To facilitate cloning, the BEcR amino acid sequence in each of these fusion proteins was augmented at the 
C terminal end with amino acids 650-878 from the Drosophila melanogaster ecdysone receptor. Similar constructs were 
made wherein a His tag was positioned at the amino terminus of the fusion protein to facilitate purification of the fusion 
protein IHisBEOR and HisBEOU, respectively). 

Gel mobility shift assays using tebufenozide and MurA as ligand were conducted as described in Example 3 above 
30 to determine whether functional dimers formed from the BEOR and BEOU fusion proteins. The results of these assays 

showed the both BEOR and BEOU dimeri2e and constitutivety bind target DNA irrespective of the presence of ligand. When 
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the study was repeated using HisOEQR, HisBEOR and HisBEOU. it was determined that the His tag does not effect the ability 
of these fusion proteins to bind target ONA. 

It will be apparent to those skilled in the art that various changes may be made in the invention without departing 
from the spirit and scope thereof, and therefore, the invention encompasses embodiments in addition to those specifically 
5 disclosed in the specification. 
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WHAT IS CLAIMED IS: 

1. A chimeric protein comprising: 
at least two functional protein units, wherein each functional protein unit comprises the dimerization 

domain of a member of the steroid/thyroid hormone nuclear receptor superfamily, and 
an optional linker interposed therebetweea 
wherein the at least two protein units form a functional entity. 

2. The chimeric protein according to claim 1 wherein the entity is an endodimer. 

3. The chimeric protein according to claim 1 wherein each protein unit comprises a ligand binding domain, an 
1 0 optional hinge domain, and an optional DN A binding domain. 

4. The chimeric protein according to claim 3 wherein the functional entity is an endodimer. 

5. The chimeric protein according to claim 1 wherein at least one member is non-mammalian. 

6. The chimeric protein according to claim 5 wherein the at least one member is from an insect species. 



15 



7. The chimeric protein according to claim 1 wherein at least one functional protein unit comprises the 
dimerization domain of an ecdysone receptor 

20 

8. The chimeric protein according to claim 7 wherein the ecdysone receptor comprises the dimerization 
domain of a Drosophila ecdysone receptor. 

9. The chimeric protein according to claim 7 wherein the ecdysone receptor comprises the dimerization 
25 domain of a Lepidoptera ecdysone receptor. 

10. The chimeric protein according to claim 7 wherein the ecdysone receptor comprises the dimerization 
domain of a Bombyx ecdysone receptor. 

^ 1 • The chimeric protein according to claim 5 wherein at least one functional protein unit comprises the 
dimerization domain of the ultraspiracle protein. 

12. The chimeric protein according to claim 1 wherein at least one member is non-mammalian. 
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13. The chimeric protein according to claim 1 wherein at least one functional protein unit comprises the 
dimerization domain of the retinoid X receptor. 

14. The chimeric protein according to claim 1 wherein the protein units are independently selected from t he 
group consisting of glucocorticoid receptors, mineralocorticoid receptors, estrogen receptors, progesterone receptors, 
androgen receptors. Vitamin D3 receptors, retinoic acid receptors, retinoid X receptors, peroxisonne proliferator activated 
receptors, thyroid hormone receptors, and steroid and xenobiotic receptors, farnesoid X receptor, pregnenolone X receptor, 
liver X receptor, and 6XR. 

15. The chimeric protein according to claim 1 wherein the linker contains from about 5 to about 245 amino 

acids. 

16. The chimeric protein according to claim 15 wherein the linker contains from about 53 to about 125 
amino acids. 

1 7. The chimeric protein according to claim 1 5 wherein the linker comprises glycine, proline, serine, alanine 
and threonine residues. 

18. The chimeric protein according to claim 1 5 wherein the chimeric protein is a chimeric protein and the 
linker comprises the amino acid sequence of SEQ 10 NO: 15. 

1 9. The chimeric protein according to claim 3 wherein one or more protein units further comprise a C-terminal 

domain. 

20. The chimeric protein according to claim 3 wherein the DNA binding domains of one or more protein units 
comprise 66 to 6B amino acids, including 9 cysteines. 

21. The chimeric protein according to claim 3 wherein the hinge domain of one or more protein units is the 
Bombyx hinge domain. 

22. The chimeric protein according to claim 1 wherein one or more protein units further comprise an 
activation domain. 

23. A polynucleotide encoding a chimeric protein according to claim 1 . 
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24. A cell containing a polynucleotide according to clainr) 23. 

25. The cell according to claim 24 wherein the cell is mammalian. 

5 26. A method for modulating the expression of an exogenous gene in a subject organism containing: 

1 ) a chimeric protein according to claim 1, and 

2) a DNA construct comprising the exogenous gene under the control of a response element with 
which the chimeric protein interacts. 

said method comprising administering to the subject an effective amount of an exogenous ligand for at 
10 least one functional unit of the chimeric protein. 

27. The method according to claim 26 wherein the chimeric protein is encoded by a DNA construct. 

28. The method according to claim 26 wherein the subject organism is a plant, an animal a fungus or a 
15 bacterium. 



29. The method according to claim 26 wherein the subject organism is mammalian. 

30. The method according to claim 28 wherein at least one of the protein units is non-mammalian. 

31 . The method according to claim 29 wherein at least one of the protein units is from an insect species. 

32. The method according to claim 26 wherein the modulation is ligand dependent repression. 



20 



25 33. A method for modulating the expression of an exogenous gene in a subject organism containing a DNA 

construct comprising the exogenous gene under the control of a response element, 

said method comprising administering to the subject an effective amount of a chimeric protein according 
to claim ], 

wherein the modulation is independent of ligand. 

30 

34. The method according to claim 33 wherein the modulation is ligand independent activation. 

35. The method according to claim 33 wherein the modulation is ligand independent repression. 
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36. A method for modulating the expression of an gene in a subject organism containing a chimeric protein 
according to claim 1 , 

said method comprising introducing to the subject an effective amount of a DNA construct comprising the 
gene under the control of a response element, 

wherein the response element is responsive to the chimeric protein and 
vtfherein the modulation is independent of ligand for the chimeric protein. 

37. The method according to claim 36 wherein the gene is exogenous. 

38. The method according to claim 36 wherein the modulation is ligand independent activation. 

39. The method according to claim 36 wherein the modulation is ligand independent repression. 

40. A method for modulating the expression of an exogenous gene in a cell containing: 

1 ) a chimeric protein according to claim 1 and 

2) a ONA construct comprising the exogenous gene under the control of a response element with 
which the chimeric protein interacts, wherein said response element controls expression of the exogenous gene, 

said method comprising administering to the cell an effective amount of an exogenous ligand for at least 
one functional unit of the chimeric protein. 

41. The method according to claim 40 wherein the modulation is ligand independent activation. 

42. The method according to claim 40 wherein the modulation is ligand independent repression. 

43. A method for modulating the expression of one or more genes in a subject organism containing an 
endogenous response element, wherein said response element controls expression of one or more genes 

said method comprising introducing a chimeric protein according to claim 1 to the subject that interacts 
with said response element, thereby modulating expression of the genels) independent of the presence of tigand for 
the chimeric protein. 

44. The method according to claim 43 wherein the chimeric protein is encoded by an inducible DNA construct 
and the modulating comprises inducing expression of the gene(s|. 

45. A method for modulating the expression of one or more genes in a subject organism containing an 
endogenous response element controlling expression of one or more genes, 
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said method comprising introducing to the subject a chimeric protein according to claim \ that interacts 
with the response element, thereby modulating expression of the geneis) dependent on the presence of endogenous 
ligand therefor. 

46. The method according to claim 45 wherein the chimeric protein is encoded by an inducible DMA construct 
and the modulating comprises inducing expression of the gene($K 

47. A method for modulating the expression of one or more genes in a subject organism containing: 

1 ) a chimeric protein according to claim 1 , and 

2) an endogenous response element controlling expression of the one or more genes, wherein the 
chimeric protein interacts with the response element, 

said method comprising introducing to the subject an exogenous ligand for the chimeric protein, thereby 
modulating expression of the gene(s) dependent on the presence of the exogenous ligand. 

48. A method for modulating the expression of one or more genes in a subject organism containing: 

1 ) a chimeric protein according to claim 1 , and 

2) an endogenous response element controlling expression of the one or more genes, wherein the 
chimeric protein interacts with the response element, 

said method comprising introducing to the subject an exogenous ligand for the chimeric protein, thereby 
modulating expression of the gene(s) dependent on the presence of the exogenous ligand. 

49. A method for modulating the expression of one or more genes in a subject organism containing: 
a chimeric protein according to claim U and 

an exogenous ligand for the chimeric protein, 

said method comprising introducing to the subject an endogenous response element controlling expression 
of the one or more genes, wherein the chimeric protein interacts with the response element, thereby modulating 
expression of the geneis) dependent on the presence of the exogenous ligand. 

50. A method for modulating the expression of one or more genes in a subject organism containing a chimeric 
protein according to claim 1, 

wherein said method comprises introducing to the subject an exogenous response element controlling 
expression of the one or more genes. 

wherein the response element interacts with the chimeric protein thereby modulating expression of the 
genets) independent of the presence of ligand for the chimeric protein. 



49 



wo 01/36447 



PCTAJS0O/4n24 



51 . A method for modutating the expression of one or more genes in a subject organism containing an 
exogenous response element controiltng expression of the one or more genes, 

said method comprising introducing to the subject a chimeric protein according to claim 1 that interacts 
with the response element, thereby modulating expression of the genels) independent of the presence of ligand for 
5 the chimeric protein. 

52. An isolated protein crystal suitable for x-ray diffraction analysis comprising a purified chimeric protein 
according to claim 1 . 

10 53. The protein crystal according to claim 52 further comprising a ligand bound to the purified chimeric 

protein so as to form a chimeric protein ligand complex. 

54. The protein crystal according to claim 53 further comprising a nucleic acid construct being a putative 
response element for the complex. 

15 

55. A set of x-ray diffraction crystal coordinates obtained by x-ray diffraction of the isolated protein crystal 
according to claim 52. 

55. A set of X ray diffraction crystal coordinates obtained by x ray diffraction of the protein crystal according 
20 to claim 54. 

57. A method for identifying a potential ligand for a member of the steroid/thyroid hormone receptor 
superf amity, said method comprising: 

creating a three-dimensional structure of a chimeric protein as defined by the x-ray diffraction 
25 coordinates according to claim 55, 

employing said three-dimensional structure to design or select the potential ligand; 
synthesizing the potential ligand: and 

contacting the potential ligand with the chimeric protein in the presence of the response element with 
which the chimeric protein interacts operativety linked to a marker gene under conditions suitable for causing 
30 expression of the marker gene to determine the ability of said potential ligand to transactivate expression of the 

marker gene. 

58. A method for identifying compounds that modulate formation of a functional entity in a cell containing: 

a chimeric protein according to claim ], and 
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a response element with which the chimeric protein interacts operatively linked to a marker 

protein, 

said nrjethod comprising contacting the cell with a test compound under conditions suitable to cause the 
chimeric protein to transactivate expression of the marker gene, and 

determining the amount of the marker protein produced as compared with the amount produced in the 
absence of the test compound, 

wherein a difference in the amount of marker gene expressed indicates a modulation of formation of the 
functional entity due to the presence of the test compound. 

59. The method according to claim 58 wherein the amount of marker protein expressed is increased, 
indicating that the test compound facilitates formation of the functional entity. 

60. The method according to claim 58 wherein the amount of marker protein expressed is decreased, 
indicating that the test compound represses formation of the functional entity. 
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